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1 PLAG GENE FAMILY AND TUMORI GENES IS 

The present invention relates to the identification of 
the PLAG gene family as a family of genes frequently 
5 associated with tumor igenesis . The invention in particular 
relates to the identification of one member of this gene 
family that is involved in benign tumors such as those with 
chromosome anomalies involving a particular region of the 
long arm of chromosome 8, for instance but not restricted to 

10 pleomorphic adenomas of the salivary glands with involvement 
of chromosome 8ql2 and a translocation partner chromosome, 
very frequently chromosome 3 [i.e. t (3 ;8) (p21;ql2) ] . In 
addition, the invention relates to another member of this 
novel family as the gene whose expression is frequently 

15 abrogated in malignant tumors such as for example but not 
restricted to malignant salivary gland tumors. Furthermore, 
the invention concerns the identification of the CTNNB1 gene 
as a prototype of tumor-specific breakpoint region genes and 
frequent fusion partner of the PLAG genes. The invention 

2 0 relates in particular to the use of the members of the PLAG 
gene family and corresponding fusion partner genes as well 
as their derivatives in diagnosis and therapy. 

Multiple independent cytogenetic studies have firmly 
implicated chromosome region 8qll-13 in the following tumor 

25 types: pleomorphic adenomas of the salivary glands (high 
frequency) , pleomorphic adenomas of the lacrimal glands 
(high frequency) , lipoblastomas (high frequency) , solitary 
lipomas (low frequency), rhabdomyosarcomas (low frequency), 
and renal cell carcinomas (low frequency). As an 

30 illustration, we here focus on pleomorphic adenomas of the 
salivary gland. This type of tumor constitutes a benign 
epithelial tumor that originates from the major and minor 
salivary glands. It is the most common type of salivary 
gland tumor and accounts for almost 50% of all neoplasms in 

35 these organs; 85% of the tumors are found in the parotid 
gland, 10% in the minor salivary glands and 5% in the 
submandibular gland. About 50% of these adenomas appear to 
: have a normal karyotype but cytogenetic studies have also 
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revealed recurrent specific chromosome anomalies. Frequently 
observed anomalies include aberrations of chromosome 8, 
usually involving the 8ql2 region with as most common 
aberration a t (3 ;8) (p21;ql2) , and aberrations of chromosome 
5 12, usually translocations involving region 12ql3-ql5. Non- 
recurrent clonal chromosome abnormalities have also been 
reported. The highly specific pattern of chromosome 
rearrangements with consistent breakpoints at 8ql2 and 
12ql3-ql5 suggests that these chromosomal regions harbour 
10 genes that might be implicated in the development of these 
tumors . 

In previous physical mapping studies, the chromosome 
8ql2 breakpoint in pleomorphic adenoma of the salivary 
glands was mapped between MPS and PENK. In an effort to 
15 molecularly clone the gene affected by the chromosome 8ql2 
aberrations in the various tumors, the present inventors 
chose directional chromosome walking as a structural appro- 
ach to define the DNA region encompassing these breakpoints. 

As a starting point for chromosome walking, the MPS 

2 0 locus was used. During these walking studies, YAC clones 

corresponding to eight different loci in. 8qll-12 were isola- 
ted and mapped by FISH analysis. Initially, the breakpoints 
were mapped within a 1 Mb region flanked by MPS proximal ly 
and by the genetic marker D8S166 distally. One YAC (CEPH 

25 166F4) within this region was shown to span the t(3;8) 

breakpoint in two tumors. Subsequently, it was shown by FISH 
that the chromosomal breakpoints as present in a number of 
primary pleomorphic adenomas were clustered within a small 
chromosomal segment which has been designated pleomorphic 

30 adenoma breakpoint cluster region (PA-BCR) . 

It has thus been found that essentially all breakpoints 
of chromosome 8ql2 map in a region indicated herein as PA- 
BCR. Further research revealed that in this region a member 
of the PLAG gene family, the PLAG1 gene, can be identified 

3 5 as the breakpoint target gene and postulated tumor aberrant 

growth gene. The PLAG gene family is a subclass of the zinc 
finger gene superfamily. The CTNNB1 gene was identified as 
the fusion partner gene of PLAG1 . Molecular analysis of the 
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chromosome breakpoints revealed that the chromosome translo- 
cations resulted in gene fusions. Fusions appeared to occur 
in the 5 • -non-coding regions of both genes (PLAG1 and CTNN— 
Bl) , exchanging regulatory control elements while preserving 
5 the coding sequences. Due to the t (3 ;8) (p21;ql2) , PLAG1 is 
activated and expression of CTNNB1 is down-regulated. Acti- 
vation of PLAG1 was also observed in adenomas with variant 
translocations, i.e. t(5;8) and t(8;15). The results of the 
present inventors indicate that PLAG1 activation due to 

10 promoter swapping is a crucial event in salivary gland 
tumorigenesis . 

The preferential fusion partner of PLAG1 is the CTNNB1 
gene, which encodes B-catenin, a cytoplasmic protein of 
about 88 kD. Its frequent involvement in pleomorphic adeno- 

15 mas point towards a critical role of the CTNNB1 gene. The 

major role of CTNNB1 may be to provide an active promoter in 
front of the PIAG1 gene. However, since the observed promo- 
ter swapping between PLAG1 and CTNNB1 leads to down-regula- 
tion of CTNNB1 expression, it may also lead to other physio- 

20 logical consequences, especially since B-catenin is a pro- 
tein that has been implicated in highly "diverse processes, 
such as human colon cancer, epithelial cell adhesion, embry- 
onal axis formation in Xenopus . and pattern formation in 
Drosophila . 

2 5 According to the present invention the aberrations in 

the PLAG3, gene on chromosome 8 and the CTNNB1 gene on chro- 
mosome 3 have been used as a model to reveal the more gene- 
ral concept of the involvement of members of these gene 
families in tumorigenesis. Although both gene families are 

30 known per se, up till the present invention the correlation 
between these families and tumor inducing chromosome aberra- 
tions, like translocations, deletions, insertions and inver- 
sions, has not been anticipated. Furthermore, until now, it 
was not previously demonstrated that alterations in the 

35 physiological expression level of the members of the gene 
family are probably also implicated in tumor development. 
: According to the invention, it was demonstrated that in 
normal adult cells the expression level of the PLAG1 gene is 



practically undetectable, whereas in aberrantly growing 
cells, the xpression level is significantly increased. It 
appears that PLAG1 is a oncogene. 

The research that led to the present invention 
5 indicates that it is probable that abberations in the PLAG1 
gene, either in its sequence and/or its expression, will 
usually lead to a benign tumor. Furthermore, preliminary 
research indicated that expression of PLAG1 is increased in 
various types of malignant salivary gland tumors. 

10 Another member of the PLAG gene family is the PLAG2 

gene. The present inventors have identified this gene, 
determined its nucleotide sequence and predicted its amino 
acid sequence. It was found that PLAG2 mapped to a region 
frequently deleted in malignant salivary gland tumors 

15 (6q24) . Thus, contrary to PLAG1, PLAG2 may be a 
tumorsuppressor gene. 

The present invention now provides for a tool to 
investigate these theories and may ultimately lead to a 
means for distinguishing between benign and malignant 

20 tumors. This knowledge can then lead to more efficient 

methods of treatment. Thus, the present invention now provi- 
des for the members of the gene families or derivatives 
thereof in isolated form and their use in diagnostic and 
therapeutic applications- Furthermore, the knowledge on the 

25 location and nucleotide sequence of the genes may be used to 
study their rearrangements or expression and to identify a 
possible increase or decrease in their expression level and 
the effects thereof on cell growth. Based on this informati- 
on diagnostic tests or therapeutic treatments may be desig- 

3 0 ned . 

In this application, " PLAG " will be used to indicate 
the involvement of these types of genes in various types of 
tumors, not necessarily restricted to pleomorphic adenomas 
of the salivary glands. The term refers to all members of 
35 the PLAG gene family involved in non-physiological 

proliferative growth, and in particular involved in benign 
or malignant tumors. Members of the PLAG gene family show 
homology between zinc finger domains that are typical for 
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the PLAGl gene. However, according to the invention it was 
furthermore found that even breaks outside the actual genes 
but in the vicinity thereof might result in aberrant growth. 
The term "PLAG gene" is therefore also intended to include 
5 the immediate vicinity of the gene* The skilled person will 
understand that the "immediate vicinity" should be 
understood to include the surroundings of the gene in which 
the breaks or mutations will result in the above-defined 
non-physiological proliferative growth. 
10 The term "Tumorigenesis gene" or "T-gene" will be used 

to indicate all members of this novel PLAG gene family and 
their corresponding translocation or fusion partners, like 
CTNNB1. 

The term "wildtype cell" is used to indicate the cell 
15 not harbouring an aberrant chromosome. "Wildtype" or "nor- 
mal" chromosome refers to a non-aberrant chromosome. 

The present invention provides for various diagnostic 
and therapeutic applications that are based on the informa- 
tion that may be derived from the genes. This information 
20 not only encompasses its nucleotide sequence or the amino 

acod sequence of the gene product derived from the gene, but 
also involves the levels of transcription or translation of 
the gene . 

The invention is thus two-fold. On the one hand the 
25 aberration in cell growth may be directly or indirectly 

caused by the physical breaks that occur in the gene or its 
vicinity. On the other hand the aberration in cell growth 
may be caused by a non-physiological expression level of the 
gene. This non-physiological expression level may be caused 
3 0 by the break, or may be due to another stimulus that activa- 
tes or deactivate the gene. At present, the exact mechanism 
or origin of the aberrant cell growth is not yet completely 
unraveled. However, exact knowledge at this time is not 
necessary to define methods of diagnosis or treatment. 
3 5 Diagnostic methods according to the invention are thus 

based on the fact that an aberration in a chromosome results 
in a detectable alteration in the chromosomes* appearance or 
* biochemical behaviour. A translocation, for example will 



result in a first part of the chromosome (and consequently 
of a PLAG gene) having been substituted for another (second) 
part (further referred to as "first and second substitution 
parts") . The first part will often appear someplace else on 
5 another chromosome from which the second part originates. As 
a consequence hybrids will be formed between the remaining 
parts of both (or in cases of triple translocations, even 
more) chromosomes and the substitution parts provided by 
their translocation partners. Since it has now been found 

10 that the breaks occur in a PLAG gene, this will result in 
hybrid gene products of that PLAG gene* Markers, such as 
hybridising molecules like RNA, DNA or DNA/RNA hybrids, or 
antibodies will be able to detect such hybrids, both on the 
DNA level, and on the RNA or protein level. 

15 For example, the transcript of a hybrid will still 

comprise the region provided by the remaining part of the 
gene/ chromosome but will miss the region provided by the 
substitution part that has been translocated. In the case of 
inversions, deletions and insertions the gene may be equally 

20 afflicted. 

Translocations are usually also cytogenetically detec- 
table. The other aberrations are more difficult to find 
because they are often not visible on a cytogenetical level. 
The invention now provides possibilities for diagnosing all 

2 5 these types of chromosomal aberrations. 

In translocations markers or probes based on the PLAG 
gene for the remaining and substitution parts of a chromoso- 
me in situ detect the remaining part on the original chromo- 
some but the substitution part on another, the translocation 

3 0 partner. 

In the case of inversions for example, two probes will 
hybridise at a specific distance in the wildtype gene. This 
distance might however change due to an inversion. In situ 
such inversion may thus be visualized by labeling a set of 
3 5 suitable probes with the same or different detectable mar- 
kers, such as fluorescent labels. Deletions and insertions 
may be detected in a similar manner. 
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According to the invention the above in situ applicati- 
ons can very advantageously be performed by using FISH 
techniques. The markers are e.g. two cosmids one of which 
comprises exon 1 and the upstream region of the PLAG gene, 
5 while the other comprises the last exon and its downstream 
region. Both cosmids are labeled with different fluorescent 
markers, e.g. blue and yellow. The normal chromosome will 
show a combination of both labels, thus giving a green 
signal, while the translocation is visible as a blue signal 

10 on the remaining part of one chromosome (e.g. 8) while the 
yellow signal is found on another "chromosome comprising the 
substitution part. In case the same labels are used for both 
probes, the intensity of the signal on the normal chromosome 
will be 100%, while the signal on the aberrant chromosomes 

15 is 50%. In the case of inversions one of the signals shifts 
from one place on the normal chromosome to another on the 
aberrant one. 

In the above applications a reference must be included 
for comparison. Usually only one of the two chromosomes is 

20 afflicted. It will thus be very convenient to use the normal 
chromosome as an internal reference. Furthermore it is 
important to select one of the markers on the remaining or 
unchanging part of the chromosome and the other on the 
substitution or inverted part. In the case of the PLAG1 gene 

25 of chromosome 8, breaks are usually found in the 5 • -UTR as 
is shown by the present invention. Probes based on the 5 1 -. 
-and 3'-UTR are thus very useful. As an alternative a combi- 
nation of probes based on both translocation or fusion 
partners may be used, e.g. 5 • -UTR of PLAG1 and 3 ' -UTR of 

30 CTNNB1 . 

"Probes" as used herein should be widely interpreted 
and include but are not limited to linear DNA or RNA 
strands, Yeast Artificial Chromosomes (YACs) , or circular 
DNA forms, such as plasmids, phages, cosmids etc.. 
3 5 These in situ methods may be used on metaphase and 

interphase chromosomes. 

Besides the above described in situ methods various 
diagnostic techniques may be performed on a more biochemical 
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level, for example based on alterations in the DNA, RNA or 
protein or on changes in the physiological expression level 
of the gene. 

Basis for the methods that are based on alterations in 
5 the chromosome's biochemical behavior is the fact that by 
choosing suitable probes, variations in the length or compo- 
sition in the gene, transcript or protein may be detected on 
a gel or blot. Variations in length are visible because the 
normal gene, transcript (s) or prcftein(s) will appear in 

10 another place on the gel or blot then the aberrant one(s) . 
In case of a translocation more than the normal number of 
spots will appear. 

Based on the above principles the invention thus rela- 
tes to a method of diagnosing cells having a non-physiologi- 

15 cal proliferative capacity, comprising the steps of taking a 
biopsy of the cells to be diagnosed, isolating a suitable 
PLAG gene-related macromolecule therefrom, and analysing the 
macromolecule thus obtained by comparison with a reference 
molecule originating from cells not showing a non-physiolo- 

20 gical proliferative capacity, preferably from the same 

individual. The PLAG gene-related macromolecule may thus be 
a DNA, an RNA or a protein. The PLAG gene may be either a 
member of the PLAG1 family or of the translocation partner 
gene family of which the CTNNB1 gene is the prototype. 

25 in a specific embodiment the diagnostic method of the 

invention comprises the steps of taking a biopsy of the 
cells to be diagnosed, extracting total RNA thereof, prepa- 
ring a first strand cDNA of the mRNA species in the total 
RNA extract or poly-A-selected fraction (s) thereof, which 

30 cDNA comprises a suitable tail; performing a PCR using a 
PLAG gene-specific primer and a tail-specific primer in 
order to amplify PLAG gene-specific cDNA's; separating the 
PCR products on a gel to obtain a pattern of bands; evalua- 
ting the presence of aberrant bands by comparison to wildty- 

3 5 pe bands, preferably originating from the same individual. 

As an alternative amplification may be performed by 
means of the Nucleic Acid Sequence-Based Amplification 
(NASBA) technique [Compton, J. (1991) Nucleic acid sequence- 



based amplification. Nature 350,91-92] or variations 
thereof . 

In another embodiment the method comprises the steps of 
taking a biopsy of the tumor to obtain cells to be 
5 diagnosed, isolating total protein therefrom, separating the 
total protein on a gel to obtain essentially individual 
bands, optionally transfering the bands to a Western blot, 
hybridising the bands thus obtained with antibodies directed 
against a part of the protein encoded by the remaining part 

10 of the PLAG gene and against a part of the protein encoded 
by the substitution part of the PLAG gene; visualising the 
antigen-antibody reactions and establishing the presence of 
aberrant bands by comparison with bands from wildtype 
proteins, preferably originating from the same individual. 

15 In a further embodiment the method comprises taking a 

biopsy of the tumor to obtain cells to be diagnosed; 
isolating total DNA therefrom; digesting the DNA with one or 
more so-called "rare cutter" (typically "6- or more 
cutters") restriction enzymes; separating the digest thus 

20 prepared on a gel to obtain a separation pattern; optionally 
transfering the separation pattern to a Southern blot; 
hybridising the separation pattern in the gel or on the blot 
with a set of probes under hybridising conditions; 
visualising the hybridisations and establishing the presence 

25 of aberrant bands by comparison to wildtype bands, 
preferably originating from the same individual. . 

Changes in the expression level of the gene may be 
detected by measuring mRNA levels or protein levels by means 
of a suitable probe. 

30 Diagnostic methods based on abnormal expression levels 

of the gene may comprise the steps of taking a sample of the 
tumor to obtain cells to be diagnosed; isolating mRNA 
therefrom; and establishing the presence and/or the 
(relative) quantity of mRNA transcribed from the PLAG gene 

35 of interest in comparison to a control. Establishing the 

presence or (relative) quantity of the mRNA may be achieved 
by amplifying at least part of the mRNA of the PLAG gene by 
means of RT-PCR or similar amplification techniques. In an 
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alternative embodiment the expression level may be 
established by determination of the presence or the amount 
of the gene product (e.g. protein) by means of for example 
monoclonal antibodies . 
5 The diagnostic methods of the invention may be used for 

diseases wherein cells having a non-physiological prolifera- 
tive capacity are selected from the group consisting of 
benign tumors, such as the tumors pleomorphic adenomas of 
the salivary glands, lipoblastomas, uterine leiomyomas, and 

10 other benign tumors as well as various malignant tumors, 

including but not limited to sarcomas (e.g. rhabdomyosarco- 
ma) and leukemias and lymphomas. 

As already indicated above, it has been found that in 
certain malignant tumors the expression level of the PLAG 

15 genes is increased. Another aspect of the invention thus 
relates to the implementation of the identification of the 
PLAG genes in therapy. The invention for example provides 
anti-sense molecules or expression inhibitors of the PLAG 
gene for use in the treatment of diseases involving cells 

20 having a non-physiological proliferative capacity by modula- 
ting the expression of the gene. 

The invention thus provides derivatives of the PLAG 
gene and/or its immediate environment for use in diagnosis 
and the preparation of therapeutical compositions, wherein 

25 the derivatives are selected from the group consisting of 
sense and anti-sense cDNA or fragments thereof, transcripts, 
of the gene or fragments thereof, antisense RNA, triple 
helix inducing molecule or other types of "transcription 
clamps", fragments of the gene or its complementary strand, 

30 proteins encoded by the gene or fragments thereof, protein 
nucleic acids (PNA) , antibodies directed to the gene, the 
cDNA, the transcript, the protein or the fragments thereof, 
as well as antibody fragments. Besides the use of direct 
derivatives of the genes and their surroundings (flanking 

3 5 sequences) in diagnosis and therapy, other molecules, like 
expression inhibitors or expression enhancers, may be used 
for therapeutic treatment according to the invention. . An 
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example of this type of molecules are ribozymes that destroy 
RNA molecules. 

Bsides the above-described therapeutic and diagnostic 
methods, the principles of the invention may also be used 
5 for producing a transgenic animal model for testing pharma- 
ceuticals for treatment of PLAG -re lated malignant or benign \ 
tumors. One of the examples describes the production of such 
an animal model. 

It is to be understood that the principles of the 

10 present invention are described herein for illustration 
purposes only with reference to the PLAG1 gene mapping at 
chromosome 8ql2 and the CTNNB1 gene on chromosome 3p21. 
Based on the information provided in this application the 
skilled person will be able to isolate and sequence corres- 

15 ponding genes of the gene family and apply the principles of 
this invention by using the gene and its sequence without 
departing from the scope of the general concept of this 
invention. 

The present invention will thus be further elucidated 
20 by the following examples which are in no way intended to 
limit the scope thereof, 

EXAMPLES 

2 5 EXAMPLE 1 

1. Introduction 

This example describes the isolation and analysis of 
overlapping YAC clones and the establishment of a YAC contig 
(set of • overlapping clones), which spans genomic DNA around 

3 0 the MPS locus and includes the translocation breakpoints, 

t(3;8) (p21;ql2) , of pleomorphic salivary glands (Fig. 8), 
Pleomorphic adenomas are benign epithelial tumors 
originating from the major and minor salivary glands. Cyto- 
genetic analysis has indicated that these tumors mainly 
3 5 display chromosome breakpoints in region ql2 of chromosome 
8. In previous studies we have shown that these breakpoints 
are located in the 9 cM interval between MOS / D8S285._and 
D8S260. Here we report directional chromosome walking stu- 
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dies starting from D8S260 and D8S285, resulting in, respec- 
tively, a rudimental YAC contig of 5 Mb and a highly detai- 
led YAC contig and long-range physical map of about 2 Mb. 
The latter YAC contig has at least double coverage and 
5 consists of 34 overlapping YAC clones isolated from the CEPH 
and ICRF YAC libraries. Their insert sizes were estimated by 
contour-clamped homogeneous electric field (CHEF) gel elec- 
trophoresis. Chromosomal localization of the YACs was inves- 
tigated by FISH analysis. On the basis of YAC insert and 

10 insert-end derived DNA markers and (publicly known) sequence 
tag sites (STSs) , with an average spacing of 65 kbp, as well 
as restriction enzyme analysis, a long-range physical map 
was established for this centromeric region of chromosome 
8ql2. Within the YAC contig, the relative positions of 

15 various known genes and expressed sequence tag sites was 
determined. This YAC contig constitutes the basis for the 
construction of a transcriptional map of this region. FISH 
analysis using YACs and cosmids, corresponding to STSs in 
this YAC contig, already revealed that all but two of the 

2 0 chromosome 8ql2 breakpoints in the primary tumors that were 
tested, mapped within a 3 00 kb interval between the MOS 
proto-oncogene and STS EM156. 

2. Materials and methods 
25 2.1. Pleomorphic adenomas of the salivary glands and cytoge- 
netic analysis. 

Primary pleomorphic adenomas of the salivary glands 
were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were established and 
30 analyzed as described before (Roijer et al., 1996). The 

primary tumors with their aberrations used in this study are 
listed in Table 1. Slides for FISH were prepared from cells 
stored in fixative at -20°C. 

35 2.2. Fluorescence in situ hybridization (FISH) 

Slides for FISH were aged for 2-10 days, or were trea- 
ted with 2x SSC (pH 7.0) at 37 °C for 30 min, and subse- 
quently denatured in 70% formamide in 2x SSC (pH 7.0), at 
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75 °C for 2-6 minutes, Cosmid and YAC DNAs (1 /ig) were labe- 
led with biotin-16-dUTP (Boehringer Mannheim) by nick trans- 
lation. Labeled DNA was purified through a Sephadex G50 
column (Pharmacia) , coprecipitated with 50 mg sonicated 
5 human placenta DNA (Sigma) , and dissolved in hybridization 
solution (50% formamide, 2x SSC pH 7.0, 50 mM sodium phosp- 
hate pH 7.0, 10% dextran sulfate). Hybridization and probe 
detection were carried out as previously described (Roijer 
et al., 1996). The alpha-satellite probe D8Z2 was obtained 

10 from Oncor. Slides were examined in a Zeiss Axiophot 

epif luorescence microscope using the appropriate filter 
combinations. Fluorescence signals were digitalized, 
enhanced and analyzed using the ProbeMaster FISH image 
analysis system (Perceptive Scientific Instruments, Houston, 

15 Texas) . Color prints were produced using a Kodak XL 7700 
monochrome continuous printer. 

2.3 DNA preparations. 

Extraction of genomic DNA and the Southern analyses 

20 were performed as described previously (Schoenmakers et al., 
1994) . DNA from YACs , cosmids, PGR products and oligonucleo- 
tides was labelled using a variety of techniques. For FISH, 
cosmid clones or inter-Alu PGR products of YACs were bioti- 
nylated with biotin-ll-dUTP (Boehringer) by nick translati- 

2 5 on. YAC DNA (100 ng) was amplified by inter Alu PCR (PI: 

CTGCACTCCAGCCTGGG , P2 : TCCCAAAGTGCTGGGATTACAG) . After initi- 
al denaturation for 5 min at 94 °C, 30 amplification cycles 
were performed each consisting of denaturation for 1 min at 
94 °C, annealing for 30 sec at 37 °C and extension for 6 min 

30 at 72 °C, and with a final extension at 72 °C for 10 min. 
Amplified DNA was purified with QIAQuick PCR Purification 
kit (Qiagen) . For filter hybridizations , probes were 
radio-labelled with alpha- 32 P-dCTP using random hexamers 
(Feinberg and Vogelstein, 1984) . In case of PCR-products 

35 smaller than 200 bp in size, a similar protocol was applied, 
but specific oligonucleotides were used to prime labelling 
reactions. Oligonucleotides were labelled using 
' gamma - 32 P-ATP. 



14 

2.4 YAC library screening. 

YAC clones in this paper were isolated from the CEPH 
mark 1 (Albert sen et al. , 1990) and CEPH mark 3 (Chumakov et 
al., 1992) YAC library, made available to us by the Centre 
5 d' Etude du Polymorphisme Humain (CEPH) . YACs were isolated 
using a combination of PCR-based screening and colony hy- 
bridization analysis (Green and Olson, 1990) . Contaminating 
Candida parapsylosis , which was sometimes encountered, was 
eradicated by adding terbinafin to the growth medium (final 
10 concentration of 25 microgram/ml) . The isolated YAC clones 
were characterized by STS-content mapping, contour-clamped 
homogeneous electric field (CHEF) electrophoresis (Chu et 
al,, 1986), restriction mapping and hybridization and FISH 
analysis. 

15 

2.5 Cosmid library screening. 

Cosmid clones were isolated from an arrayed human 
chromosome 8-specific cosmid library (Wood et al. , 1992) 
obtained from Los Alamos National Laboratory (LANL) . 
20 LANL-derived cosmid clones are indicated by their microtiter 
plate addresses. Cosmid DNA was extracted using standard 
techniques involving purification over Qiagen tips (Qiagen) . 

2.6 Pulsed-field gel electrophoresis and Southern blot 
25 analysis. 

Pulsed-field gel electrophoresis and Southern blot 
analysis were performed exactly as described by Schoenmakers 
et al. (1994). Agarose plugs containing high-molecular 
weight YAC DNA (equivalent to about 1 x 108 yeast cells) 

30 were twice equilibrated in approximately 25 ml TE buffer (pH 
8.0) for 30 min at 50 °C followed by two similar rounds of 
equilibration at room temperature. Plugs were subsequently 
transferred to round-bottom 2 ml eppendorf tubes and equili- 
brated two times for 3 0 min in 500 microliter of the approp- 

35 riate 1 x restriction-buffer at the appropriate restriction 
temperature . 

Thereafter, DNA was digested in the plugs according to 
the suppliers (Boehringer) instructions for 4 h using 30 



15 

units of restriction endonuclease per digestion reaction. 
After digestion , plugs along with appropriate molecular 
weight markers were loaded onto a 1% agarose / 0.2 5 x TBE 
gel, sealed with LMP-agarose and size fractionated on a CHEF 
5 apparatus (Biorad) for 18 h at 6.0 V/cm using a pulse angle 
of 12 0 degrees and constant pulse times varying from 10 sec 
(separation up to 3 00 kbp) to 2 0 sec (separation up to 500 
kbp) . In the case of large restriction fragments, additional 
runs were performed, aiming at the separation of fragments 

10 with sizes above 500 kbp. 

Electrophoresis was performed at 14 °C in 0.25 x TBE. As 
molecular weight markers, lambda ladders (Promega) and 
home-made plugs containing lambda DNA cut with restriction 
endonuclease Hindlll were used. After electrophoresis, gels 

15 were stained with ethidium bromide, photographed, and UV 
irradiated using a stratalinker (Stratagene) set at 120 mJ. 
DNA was subsequently blotted onto Hybond N+ membranes 
(Amersham) for 4-16 h using 0.4 N NaOH as transfer buffer. 
After blotting, the membranes were dried for 15 min at 80 °C, 

20 briefly neutralised in 2 x SSPE, and prehybridised for at 
least 3 h at 42 °C in 50 ml of a solution- consisting of 50% 
formamide, 5 x SSPE, 5 x Denhardts, 0.1% SDS and 200 
microgram/ml heparin. Filters were subsequently hybridised 
for 16 h at 42 °C in 10 ml of a solution consisting of 50% 

25 formamide, 5 x SSPE, 1 x Denhardts, 0.1% SDS, 100 

microgram/ml heparin, 0.5% dextran sulphate and 2-3 x 106 
cpm/ml of labelled probe. Thereafter, membranes were first 
washed two times for 5 min in 2 x SSPE/ 0.1% SDS at room 
temperature, then for 30 min in 2 x SSPE/0.1% SDS at 42 °C 

30 and, finally, in 0.1 x SSPE/0.1% SDS for 20 min at 65°C. 
Kodak XAR-5 films were exposed at -80 °C for 3-16 h, 
depending on probe perf ormance. Intensifying screens (Kyokko 
special 500) were used. 

Agarose plugs containing high-molecular weight yeast + 

35 YAC DNA (equivalent to 1 x 10 9 cells ml" 1 ) were prepared as 
described before (Schoenmakers et al., 1994). Plugs were 
. thoroughly dialysed against four changes of 25 ml T10E1 (pH 
8.0) followed by two changes of 0.5 ml 1 x restriction 
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buffer before they were subjected to either pulsed-f ield 
restriction enzyme mapping or YAC-end rescue. 

2.7 PCR analysis. 
5 PCR amplification was carried out using a Pharmacia 

LKB-Gene ATAQ Controller (Pharmacia/LKB) or a Perkin Elmer 
9600 (Perkin-Elmer Cetus) in final volumes of respectively 
50 and 25 microliter containing 10 mM Tris-HCl pH 8.3, 50 mM 
KCl, 1.5 mM MgCl 2 , 0.01% gelatine, 2 mM dNTPs, 20 pmole of 

10 each amplimer, 1.25 units of AmpliTaq (Perkin-Elmer Cetus), 
and 100 ng (for superpools) or 20 ng (for pools) of DNA. 
After initial denaturation for 5 min at 94 °C, 30 amplifica- 
tion cycles were performed each consisting of denaturation 
for 1 min at 94 °C, annealing for 1 min at the appropriate 

15 temperature (see Table 3), and extension for 1 min at 72 °C, 
and with a final extension at 72 °C for 10 min. Results were 
evaluated by analysis of 10 microliter of the reaction 
product on 2% agarose gels. 

20 2.8 Generation of STSs from YAC inserts and insert ends. 

YAC-end rescue was performed using a vectorette-PCR 
procedure in combination with direct solid phase DNA sequen- 
cing, as described before (Geurts et al., 1994). STSs speci- 
fic for YAC inserts were generated from inter-Alu PCR pro- 

25 ducts, isolated using published oligonucleotides TC65 or 517 
(Nelson et al., 1989) to which Sail-tails were added to 
facilitate cloning. Figure 10 shows some of these STSs. 
After sequence analysis, primer pairs were developed using 
the OLIGO computer algorithm (Rychlik, 1989) . They were 

30 tested on human genomic DNA, basically according to 
procedures described above. 

2.9 Nucleotide sequence analysis and oligonucleotides. 

Nucleotide sequences were determined according to the 
35 dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia/LKB or the dsDNA Cycle Sequen- 
cing System (GIBCO/BRL) . Sequencing results were analyzed 
using an A.L.F. DNA sequencer™ (Pharmacia Biotech) on 
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standard 3 0 cm, 6% Hydrolink*, Long Range™ gels (AT Bio- 
chem) . Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI; Altschul et al., 1990). All 
oligonucleotides were purchased from Pharmacia Biotech. 

5 

3 . Results 

3.1 Isolation of YAC clones to start chromosome walking. 
According to radiation hybrid mapping data, band 8ql2 

is flanked by markers D8S165, D8S285 proximally, and by 

10 D8S260 distally (Sapru et al. , 1994). The estimated genetic 
distance between these markers is 9 cM (Gyapay et al., 
1994) . Using primer sets for these microsatellite markers, 
we selected sets of corresponding YAC clones. These YACs 
were used for initiating the walk towards the 8ql2 translo- 

15 cation breakpoints, meanwhile generating a long-range contig 
based upon STS-content mapping of these. Hence we started to 
construct a 8ql2 centromeric YAC contig (from D8S165 / 
D8S285) and a 8ql2 telomeric YAC contig (from D8S260) . By 
fluorescent in situ hybridisation (FISH) experiments, we 

20 could soon show that all but two of the translocation 

breakpoints in primary adenomas mapped within a 1 cM inter- 
val within the centromeric YAC contig (Roijer et al. , 1996). 
Hence, the YAC contig of the telomeric side of 8ql2 was not 
characterized in that much detail. The two YAC contigs could 

2 5 not be linked to each other. 

3.2 Assembly of a YAC contig starting from D8S285 

In previous studies (Roijer et al., 1996) we have 
described the isolation of two overlapping YACs containing 

30 D8S285, namely 935E9 and 166F4. These YACs, together with a 
MOS containing YAC (900gl0157) , were used as a starting 
point for a chromosome walking project to define the 8ql2 
centromeric region, as mentioned in the introduction. Initi- 
ally, chromosome walking was performed bidirectionally . This 

35 allowed us to hook up our 8ql2 centromeric contig to a 
chromosome 8qll YAC contig. Both contigs have the marker 
D8S593 in common. In the subsequent PCR based screening of 
the human CEPH YAC libraries with D8S108 and D8S166, we 



isolated MegaYAC 946B7, which together with MegaYAC 935E9, 
formed the backbone of our centromeric contig. 

In the bidirectional and subsequent unidirectional 
chromosome walking steps, the following general procedures 
5 were used. First, rescuing and sequencing the ends of YAC 
clones resulted in DNA markers characterizing their left and 
right sides (Table 2) . Based on sequence data on the ends of 
the YAC insert, as well as on inter-Alu PCR products, primer 
sets were developed for specific amplification of DNA, 

10 establishing STSs (Table 3). Their location to 8ql2-qter was 
determined by CASH as well as by FISH after corresponding 
YAC and/ or cosmid clones were isolated. With the sequenti- 
ally selected and evaluated primer sets, screening of the 
YAC and cosmid libraries was performed to isolate the buil- 

15 ding blocks for contig assembly. Hence, we established a 

8ql2 centromeric YAC contig consisting of 34 overlapping YAC 
clones, covering approximately 2 Mb of DNA (Fig. 1) . The 3 4 
YACs are between 160 kb and 1660 kb long. This YAC contig 
appeared to encompass the chromosome 8ql2 breakpoints of all 

20 but two primary adenomas studied. YAC 166F4 for instance, 
was already shown to cross the translocation breakpoint in 
two adenomas with t ( 3 ; 8 ) (p2 1 ;ql2 ) (Roijer et al. , 1996). 
Characteristics of the YACs that were used to build this 
contig are given in Table 2. 

25 The YAC contig was constructed in parallel with the 

screening data of Whitehead Institute / MIT Center for 
Genome Research available via http://www-genome.wi.mit.edu- 
/cgi-bin/contig/lookup_contig (Contigs WC8 . 7 and WC-157) . 

30 3.3 Physical mapping of polymorphic markers, ESTs and genes 
All markers and genes which were or became publicly 
available during our work were STS content mapped on our 
contig and those found positive were subsequently sublocali- 
zed by (primer) hybridization on YAC Southern blots. The 
3 5 markers that were found to reside within the centromeric 
contig presented here were D8S1069 (WI-536) , AFMB055WG9, 
D8S125, D8S108, D8S1661 (WI-5459), D8S166, D8S96, D8S1816, 
* D8S285, D8S1516 (WI-3862), D8S1828 and D8S165. Furthermore, 
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besides the MOS proto-oncogene (Testa et al., 1988), two 
other genes could be positioned in our centromeric YAC 
contig, namely the YES related proto-oncogene LYN (Corey and 
Shapiro, 1994) and the preproenkephalin gene PENK (Tomfohrde 
5 et al., 1992). Finally, our contig contains 2 expressed 
sequence tags, namely WI-7754 (D8S1930) , corresponding to 
LYN, and IB2 045, not resembling any currently known gene. 
The genes encoding the kappa opioid receptor (0PRK1) , 
(Yasuda et al., 1994) and interleukin-7 (IL7) (Brunton and 
10 Lupton, 19 90) are not contained in our YAC contig. 

3.4 Restriction map and putative CpG islands: Construction 
of a rare-cutter physical map from the 2 Mb YAC contig 
in the 8ql2 centromeric region 

15 Southern blots of total yeast plus YAC DNA, digested to 

completion with rare-cutter enzymes BssHII, Kpnl, Mlul, 
NotI, Pvul, Sail and Sfil (see Materials and Methods) and. 
separated on CHEF gels, were hybridized sequentially with 
(i) the STS used for the initial screening of the YAC in 

20 question, (ii) pYAC4 right arm sequences, (iii) pYAC4 left 
arm sequences, and (iv) a human Alu-repeat probe (BLUR-8) . 
The long-range restriction map that was obtained in this way 
was completed by probing with PCR-isolated STSs/YAC end 
probes. Restriction maps of individual YAC clones were 

25 aligned, and a consensus restriction map was established. 
The region was searched for CpG islands on the basis of the 
colocalization of sites for these rare-cutter enzymes (Fig. 
1) . 

30 3.5 FISH mapping of 8ql2 breakpoints in pleomorphic 
adenomas 

In previous studies we have mapped the t (3 ;8) (p21;ql2) 
breakpoint within a 1 Mb region flanked by MOS proximally 
and by the genetic marker D8S166 distally. One YAC within 
35 this region (166F4) was shown to span the t(3;8) breakpoint 
in two primary pleomorphic adenomas (Roijer et al. (1996)). 
We have now extended these previous observations by analysis 
of additional tumors including cases with different variant 
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translocations (Table 1) . Six of eight primary adenomas with 
8ql2 abnormalities had breakpoints mapping within YAC 166F4 
(Fig. 2 A) . Also rumors with other 8ql2 rearrangements than 
the classical t(3;8) had breakpoints within the same YAC, 
5 demonstrating that this is the major breakpoint cluster 
region in pleomorphic adenomas with 8ql2 abnormalities. 

To fine map this region we isolated cosmids from an 
arrayed human chromosome 8-specific cosmid library (obtained 
from Los Alamos National Laboratory; Wood et al. (1992)) 

10 using markers contained within YAC 166F4. Cosmid clones 

containing MPS , EM156 and CH129 were isolated and mapped by 
FISH (Fig. 1) . The breakpoints in all six cases were 
localized between cosmids CEM1 ( MPS ) en CEM23 (EM156) 
(Fig. 2B and C) . In for example CG 666, CEM1 was transposed 

15 to 8p due to a pericentric inversion of the dicentric marker 
chromosome (Fig. 2B) , while CEM23 remained in its normal 
position at 8ql2 (Fig. 2C) . In adenoma CG 682, which has an 
ins (8; 3) (ql2 ;p21 . 3pl4 . 1) , CEM23 spanned the insertion 
breakpoint (Roijer et al. (1997)). 

20 Two of the eight adenomas tested, CG 650 and CG 787, 

had breakpoints clearly proximal to MPS.. To further map 
these breakpoints we isolated two additional sets of YACs 
corresponding to CH37, D8S593 and the recently identified 
XRCC7 gene (Blunt et al. (1995)), respectively. In CG 650 

2 5 the breakpoint was found to reside between ICRF YAC 
900gl0157 and YAC 898G12 (Fig. 1) , and in CG 787 the 
breakpoint was found to reside within the XRCC7 containing 
YAC 943G4 (Fig. 2D) . In the latter case, which judged from 
cytogenetics had an ins (8 ; 7 ) (qll-12 ;p2 lpl4 ) , FISH analysis 

30 revealed signals on both the der(7) and the der(8) 
chromosomes, demonstrating that this is not a simple 
8 : 7-insertion but a complex rearrangement with at least two 
breakpoints on 8q, one of which maps within YAC 94 3G4 and 
the other proximal to this YAC. 

35 3.6 Assembly of a YAC contig between D8S260 and D8S507 
Two MegaYACs corresponding to D8S2 60, were used to 
start the chromosome walk from the telomeric side of chromo- 
some 8ql2.The YAC insert-ends from these two clones (783H12 



21 



and 814A6) were isolated and used for STS content mapping* 
The most telomeric STS (CH31) , is present in two YACs also 
containing D8S510, and hence links the 8ql2 YAC contig to a 
8ql3 specific YAC contig (Koenig et al., 1995). When we 
5 started our physical mapping effort, only one polymorphic 
marker, D8S507, was described in between D8S285 and D8S260 
(Gyapay et al., 1994). Initially, we were unable to isolate 
YACs containing D8S507. In stead we used the corresponding 
amplimers to isolate a cosmid containing D8S507 (cosmid 

10 CEM3) . An insert-ends of this cosmid "clone (EM7 3) was then 
used to isolate corresponding MegaYACs (872E1, 882D9 and 
9 57C4) . It is unclear why these YACs were initially not 
recognised upon the YAC library screenings. This might have 
been due to either a less efficient PCR of D8S507 than of 

15 EM73, or to an underrepresentation of the D8S507 YAC-contai- 
ning yeast cells in the primary pools of the specific 
library we screened. The availability of two new polymorphic 
markers, D8S1505 and D8S1515 (WI-6879) allowed to link the 
two separate MegaYAC contigs around D8S260 and D8S507. The 

2 0 composite contig consists of 2 3 CEPH MegaYACs and covers 

approximately 5 Mb of genomic DNA (Fig. 3) . The polymorphic 
markers that were tested and found to reside in the telome- 
ric YAC contig are D8S1151 (WI-1155) , D8S260, D8S1075 
(WI-943), D8S1505, D8S1515 (WI-6879), D8S1723, D8S507 and 

25 D8S1957 (WI-9507) . 

One gene, CYP7 , encoding cholesterol 7a-Hydroxylase 
(and previously assigned to 8qll-12 using both mouse-human 
somatic cell hybrids and FISH (Cohen et al., 1992)) was 
mapped in this contig using a primerset that we developed on 

30 publicly available sequence data (see Table 3) . 

The YAC contig was constructed in parallel with the scree- 
ning data of Whitehead Institute / MIT Center for Genome 
Research available via http://www-genome.wi.mit.edu/ 
cgi-bin/contig/lookup_contig (Contigs WC8.8 and WC-727). 

35 

4 . Discussion 

In this example the physical map of 2 Mb of the - 
centromeric region of chromosome 8ql2 composed of 3 4 YAC 
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clones is presented. It is tagged by 31 markers with an 
average interval of 65 kb and was validated by FISH mapping. 
The frequency of chimaerism in the set of YACs studied in 
detail (YAC end rescue and/or restriction enzyme mapping 
5 and/or FISH) was slightly lower (6/26 = 23%) than the 30-40% 
expected (Van Ommen, 1993) . These data provide the most 
detailed available physical map of the 8ql2 region. The map 
was constructed in parallel with the establishment of a 
low-resolution map of the human genome and with the 

10 screening data of the Whitehead Institute/MIT Center for 
Genome Research. It was compared with data available 
electronically via Internet. The order of the markers on the 
physical map is in agreement with the 199 6 Genethon human 
genetic linkage map (Dib et al. (1996)). In total, 

15 31 markers/genes were ordered on the contig as follows: 
D8S1069-CH274-TC65. 2-AFMB055WG9-CH277-CH34-D8S12 5 — 
D8S108-EM76-D8S1661-CH12 2-D8S96-D8S166-CH69-CH3 3-D8S1816-PE- 
NK-CH280-CH129-EM156-END2-CH283-D8S2 85-MOS-D8S1516-LYN-D8S1- 
8 2 8-IB2 04 5-D8S16 5-CH37-W1086. 

20 A restriction map with the enzymes BssHII, Kpnl, Mlul, 

NotI, Sail, Sfil, Swal was generated, revealing at least 7 
putative CpG islands. Three of these island correspond to 
previously known genes (PENK, MOS and LYN) , while the others 
might represent the 5' end of unidentified genes so far. Our 

25 2 Mb map integrates cytogenetic, genetic and physical data 
of chromosome band 8ql2. Chromosome band 8ql2 seems to be 
very gene-poor: so far only 4 genes were shown to reside in 
both contigs (CYP7 , PENK, MOS and LYN). This is consistent 
with the view that Giemsa-positive bands are chromosome 

30 regions rather low in G+C content and poor in genes (Craig 
and Bickmore, 199 3) . Chromosome 8ql2 is partially conserved 
in synteny with mouse chromosome 4 (Mock et al., 1996). 
Comparative gene mapping however did not reveal any mouse 
gene or marker not present on our physical map. 

35 Our physical map contains at least one disease locus, 

i.e, the pleomorphic adenoma translocation breakpoint. FISH 
analyses using YACs and cosmids revealed that the majority 



of breakpoints clustered within a 3 00 kb region in both 
adenomas with the recurrent t(3;8) and in those with 
different sporadic 8ql2 translocations. A cosmid from this 
region was also shown to span the breakpoint in an adenoma 
5 with an ins(8;3) (Roijer et al. (1997)). This region is 

therefore likely to harbor the pleomorphic adenoma gene. The 
breakpoint region contains one known gene ( MPS ) , one 
polymorphic marker (D8S285) , and three new STSs (EM156, END 2 
and CH283). BLAST searches of these STSs revealed that CH2 8 3 

10 displayed sequence identity with a publicly available EST 
(expressed sequence tag) . 

Two of the eight tumors in the present series had 
breakpoints outside the 3 00 kb breakpoint region. By FISH we 
could demonstrate that the breakpoints in both cases were 

15 clearly proximal to MPS . In one case the breakpoint was 

mapped between MPS and YAC 898G12, and in the other case it 
was found to reside within YAC 943G4. The latter YAC maps at 
least 2 Mb centromeric to MPS . Since the two breakpoints are 
clearly different and also separate from those located in 

20 the 300 kb breakpoint region it is unlikely that they affect 
the same gene. However, we can of course, not rule out the 
possibility that these cases have additional rearrangements 
that may have escaped detection with the DNA probes used, 
and that they in fact all have rearrangements of a common 

25 gene in the breakpoint region. 

Recently, we have obtained molecular evidence for an 
additional breakpoint region distal to our telomeric contig 
(Roijer et al. (in manuscript)). Based on FISH using YACs 
derived- from 8ql3-21 we have estimated that these 

3 0 breakpoints are about 15 to 20 cM telomeric to the 8ql2 
breakpoint cluster region. Cytogenetically , these 
breakpoints are very difficult, or impossible to distinguish 
from the classical 8ql2 breakpoints. This is also true for 
the breakpoints located centromeric to MPS . Collectively, 

3 5 our findings thus suggest that there are several genes in 
the proximal part of 8q that are affected by chromosome 
rearrangements in pleomorphic adenomas. 



In addition to pleomorphic adenomas there are also a 
few other types of solid tumor with structural 
rearrangements involving 8qll-l3, namely lipoblastomas (Dal 
Cin et al. (1994); Sawyer et al. (1994)), rhabdomyosarcomas 
5 (Mitelman (1994)) and renal cell carcinomas (Elfving 
(1996)). It should also be mentioned that pleomorphic 
adenomas of the lacrimal glands show recurrent 
rearrangements of 8ql2 (Hrynchak et al. (1994)), including a 
t (3;8) (p21;ql2) . Because of the well known histopathological 

10 and clinical similarities between pleomorphic adenomas 
originating from the salivary and lacrimal glands it is 
likely that the 8ql2 abnormalities in these tumor types 
affect the same gene. As for the other tumor types mentioned 
it remains to be shown whether the rearrangements in these 

15 cases also affect this gene. Preliminary FISH analysis of 
lipoblastoma with an 8ql2 translocation breakpoint have 
shown that the breakpoint, as in pleomorphic adenomas, is 
distal to MPS (unpublished observations) . Unfortunately no 
markers telomeric to MPS could be tested. FISH studies are 

2 0 now in progress to resolve these critical questions. 

Finally, a second YAC contig was established between 
D8S260 and D8S507, consisting of 23 CEPH MegaYACs and cove- 
ring approximately 5 Mb. Both contigs represent about 75% of 
human chromosomal band 8ql2, which extends over approximate- 
25 ly 9 Mb of DNA. They will provide molecular access to new, 
previously unidentified genes from this region and especial- 
ly to those directly affected by the chromosome 8ql2 aberra- 
tions in pleomorphic adenomas of the salivary gland. 

3 0 6_j_ Figure legends 

Figure l. 

Physical contig and STS marker map in proximal human 
8ql2. The upper line indicates the chromosome, with the 
3 5 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are -indi- 
cated in bold. YAC clones are represented as lines together 
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with their names and are described in Table 2 . STS locations 
are indicated by dashed lines. The length of the solid line 
indicates its approximative size. The size of YAC 391H2 and 
392A6 is too large to fit within the constraints of the map, 
5 suggesting chimaerism. In these cases, the lenght of the YAC 
corresponds only to the markers it contains, YAC end clones 
used as STS are represented by arrows. Below the contig the 
consensus restriction maps for BssHII (B) , Kpnl (K) , Mlul 
(M) , NotI (N) , Pvul (P), Sail (S) and Sfil (Sf) are indica- 
10 ted. Verical arrows indicate putative CpG islands, defined 
as the colocalization of sites for~ (a) K, M, N; (b) K, M, P, 
S, Sf; (c) K, M, P, Sf; (d) K, N, S; (e) K, N, S; (f) B, N, 

s; (g) b, k, n, sf. 

The pleomorphic adenoma gene region is indicated by a gray 
15 shaded box. 

Figure 2 . 

Physical contig and STS marker map in distal human 
8gl2. The upper line indicates the chromosome, with the 

20 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones and cosmid CEM3 are represented as 
lines together with their names. YAC end clones used as STS 

2 5 are represented by arrows. 

Figure 10. 

STSs used to generate the 300 kb cosmid contig mapping 
at chromosome 8ql2 and encompassing PLAG1 . 

30 
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TABLE 2: Analysis of YAC clones in the centromeric contig 



CEPH code 


Size ( kb) 


Landm. left Acc. No. 


Landm. right Acc. No. 


Chimeric 


518G7 




CH131 




Y (left) 


518G12 




CH132 




ND 


770B2 


1410 


CH281 


CH286 


ND 


878F2 


800 


CH282 


CH287 


ND 


255HI0 


390 


CH273 


CH272 


ND 


531D4 


430 


CH274 


CH277 


ND 


15H11 


430 






ND 


29ipi2 


410 


CH29 


CH34 


Y (left) 


47E9 


370 






ND 


26F6 


460 






ND 


916F12 


1230 


CH122 


CH128 


ND 


388D6 


430 


CH69 


CH100 


ND 


297E11 


350 


CH280 


CH285 


ND 


253 H7 


160 


CH279 


CH284 


ND 


143D5 


260 


CH278 


CH283 


ND 


391H2 


620 






Y 


392A6 


600 






Y 


84E12 


500 


CH289 


CH290 


ND 


166F4 


700 


CH67 


CH73 


ND 


164H5 


375 


END2 


END6 


ND 


206F11 




END3 


END7 


V (right) 


253C7 


620 


END4 


END8 


ND 


295A4 




CH68 


CH74 


Y (left) 


917E11 


1380 


END1 


END5 


ND 


946B7 


1370 


CHI 23 


CH129 


ND 


935E9 


1660 


CH33 


CH37 


ND 



Note. YAC clones were isolated from CEPH YAC libraries as described under Materials and 
Methods. ND. not detected by methods used (FISH, YAC end rescue, restriction enzyme 
mapping). GenBank accession numbers are given. 
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TABLE 3: 8ql2 STS primer sequences, annealing temperatures, and 
expected PCR product sizes 

Amplim ers on published sequences centromeric contig 

STS name Nucleotide sequence 5 1 - 3' Product T a (°C) 

size (bp) 



LY N GGAAGGAAAGGAAAGGAGA 194 60 

GGTTTGGGTGTTTGGTGTG 



GAGCTCAACGTAGCAAGGCT 203 60 

AACTGTCCTCCAGTGCGG 



PENK TAATAAAGGAGCCAGCTATG 75-83 58 

ACATCTGATGTAAATGCAAGT 

D8S96 TCTCTACCTCGACATACTCCTGC 113 60 

GTC A G A A GTGCC A ATC A G A CTG 

D8S108 CAAACCTTGAATTACAAAACAG 116 58 

TGTTAATATTAGACCACCTTTC 

D8S125 TTCTGTTTCCTGGTCTCAGTAGC 144 62 

CACATGCATATGTGTATGTGTGTG 

D8S165 ACAAGAGCACATTTAGTCAG 138-152 58 

AGCTTCATTTTTCCCTCTAG 

D8S166 GATTGTGTCATTGCACTCCA 116 58 

A C A A GG A A GTTCCTTTTTGG 

D8S285 GC ATC A C A CA G A ATCTTTG 108-124 60 

ATGGGTTTATGGCCTTTAC 

D8S1069 - AGCACAGTGGATATTTTTAGGC 221 60 

GGGGCTCACACAGAAGTTAA 



D8S1516 



GTCCCCCATCAACATGCTG 
CTCATCTTGTTTTCATAGTGTTCC 



174 60 



D8S1661 



TCTCATGCATGTTTCCTGTTG 126 
GTTGGGGTCATTAAACACTAGTCA 



60 



D8S1816 



TGCACCCTTAAAAAGCATCG 
ACTTGCGAACATGGGATCAC 



143-147 



60 



D8S1828 



AFMB055WG9 



AGTGCTGTTTTTACTTCTGTACG 
GCAAGACTCTGTCTCAGGA 

CTCCCAACCCACCCGAC 
TGAAAACCATAATCTCTGATGTTGC 



198-238 



218 



60 



60 



Amplimers on newly isolated STSs centromeric contig 
STS name Nucleotide sequence 5' - 3' 



Product T a (°Q 
size (bp) 



CH33 



CCTTTGGCTGGGGTTTATA 
GGCCTATGAAGCAAGAGAG 



168 



60 



CH34 



GTACCCAGAAGGCAAGTAA 
GTGAAAAGGCAGAAATTAG 



145 



60 



CH37 



TTGCATGAGAATGGAAATG 
GGCGTT A CTTT CC 11 11 GT 



176 



58 



CH69 



AGTGCTTACAATAGGGTGAG 
CCATCCAGAAAGACCATAAT 



336 



60 



CH122 



TTTGTCTTTGATTTTTATGG 
TGACCAACATACTGCCTAGT 



250 



57 



CH129 



CTGAATCCCAGAACAATATA 
AGGGTAAGTATGTCCTTTAA 



110 



60 



CH273 



30 

ATAATGTTGAGACTTTGAGA 
AAATGTTTATCCTAATTGTA 



♦ * « ♦ t 

* • • • 

' • i 

' » • r 



157 



58 



CH274 



CAGGTGAGTGGATGGTGTAA 
CAAGGGGAGACCAAATCATC 



241 



60 



CH277 



AATGGCTATGAGGTTGTTTT 
CACATCCTTTCATTTTAGCA 



122 



58 



CH280 



GGGCTGATGTTCCATTAACT 
GCTTCAACACCAAAAATGCT 



163 



58 



EM76 



CTGGGAAGAGATCAAAATTC 
TAAAGAGACAGCACCACAAA 



220 



60 



EM 156 



AGTAGCAGCAGCAACAGTCA 
TGCGCTATTCAGAGAAGATG 



160 



60 



EM216 



CAGTCAGTTCCAGAGGTCATTT 
T A GGGA GGGCTTTA ATAGTGTT 



255 



55? 



END1 



GCTCACTTCACTCCTACCC 
CAACCAACCACTAAAAACG 



161 



60 



END2 



GTG ATTTTA C A GCC ATTTT 
TGTAA I _ i _ 1 _ 1 CAACCAGAAG 



91 



50 



TC65.2 



TACAAACCGGGAGAAAACAG 
TTACAGCATTTCCGATTTTG 



232 



58 



Amplimers on published sequences centromeric contis 



STS name 



Nucleotide sequence 5 1 - 3 ! 



Product T a (°C) 
size (bp) 



CYP7 



TTGA C [•['[[ A AATATG AT AGGTAT 
ACT1T1 ' ATTTCTG A A AGATGA ATCA 



1600 



55 



D8S260 



AGGCTTGCCAGATAAGGTTG 
GCTGAAGGCTGTTCTATGGA 



187-213 



60 



DSS507 



TTCCTCAGAGCAGTTCAAAG 
TAATCTTGCCCCAGTGAGAG 



155-169 



60 



D8S1113 



ATGCAAAGATGAACCAGGAA 
CCCTGGACTCATGGTACTTG 



216 



62 



D8S1505 



GGATTTTAAGTTTCTACAAAGGGA 328 
ACAATTTCCATGAAGTGTTCACC 



58 



D8S1723 



AGCTCAATGGCACGTCCTTT 
ACTGCTGACTCAGAGCCTGG 



235-243 



60 



Amplimers on newly isolated STSs telomeric contig 



STS name 



Nucleotide sequence 5' - 3' 



Product T a (°Q 
size (bp) 



CH31 TCACAGAATAATACAGGAT 220 58 

GATCACTGATGATACTAGG 

CH32 ATTTGCCTCAGTGTTGCAG 245 62 

ATTTTCAGGAGGTCAGGGA 



CH35 



CAAAATGACTTATGCTGAA 
TCTATACAGGGCATTGTGA 



165 60 



EM73 



AAAGCAAGACCCTGTAAAGC 
CTTGGGCTCTATTTTGTGAA 



351 



60 



EXAMPLE 2 

5 1. Introduction 

In this example the identification of a member of the 
PLAG gene family that appears to be of pathogenetical rele- 
vance is described. In previous experiments, it was found 
that the segment between MPS and PENK on human chromosome 
10 8ql2 harbors the recurrent chromosome t(3;8) (p21;ql2) 

breakpoint frequently found in pleomorphic adenomas of the 
salivary glands. Using a positional cloning approach, the 
PLAG1 gene was identified and its genomic organization 
characterized . 

15 The largest cytogenetic subgroup of pleomorphic adenoma 

of the salivary glands carries chromosome 8ql2 aberrations 
with 3p21 as preferential translocation partner. Here we 
demonstrate that the t(3;8) (p21;q!2) results in promoter 
swapping between PLAG1 , a novel, developmentally regulated 

20 zinc finger gene on 8ql2, and the constitutively expressed 
gene for 6-catenin (CTNNB1) , a protein interface functioning 
in adherens junctions and the WG/WNT signalling pathway. 
Fusions occur in the 5 1 -non-coding parts of both genes, 
exchanging regulatory control elements while preserving the 

25 coding regions. Due to the t ( 3 ; 8 ) (p2 1 ;q!2 ) , PLAG1 is activa- 
ted and expression of CTNNB1 down-regulated. Activation of 
PLAG1 was also observed in an adenoma with a variant trans- 
location t(8;15). The results indicate that PLAG1 activation 
due to promoter swapping is a crucial event in salivary 

30 gland tumor igenesis . 

Recently, molecular insight was obtained into the 
genetic basis of benign tumors by the discovery of a common 
genetic denominator in tumors of different types but with a 
particular cytogenetic profile (Schoenmakers et al. (1995) ; 

35 Ashar et al. (1995)). Molecular analysis of recurrent 

chromosome 12ql3-15 aberrations in pleomorphic adenoma of 
the salivary glands, lipoma, uterine leiomyoma, hamartoma of 
lung and breast, fibroadenoma of the breast, angiomyxoma, 
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and endometrial polyps revealed the consistent involvement 
of the high mobility group protein gene HMGIC; the first 
"benign oncogene" . 

Evidence is mounting that the same is likely to hold 
5 true for the closely related family member HMGIY , which is 
located on the short arm of chromosome 6. Both genes belong 
to the high mobility group (HMG) protein gene family (Bustin 
et al. (1990). Their corresponding proteins possess three 
amino-terminal AT hooks, through which the proteins are 

10 assumed to bind to A/T-rich DNA sequences, and an acidic 

tail in the carboxy-terminal region. Functionally, they act 
as architectural factors in the nuclear scaffold, critical 
for the correct assembly of stereospecif ic transcriptional 
complexes (Wolffe (1994) ) . 

15 The genetic aberrations frequently result in the 

separation of the three AT hooks from the acidic tail, 
although all the pivotal changes in the high mobility group 
protein genes that can lead to aberrant growth control are 
not yet known. It should be noted that a variety of chromo- 

20 some segments can act as translocation partner of 12ql3-15 
but each tumor type seems to have a preferential transloca- 
tion partner. At present, only the preferential t(3;12) (q27- 
28;ql5) translocation of ordinary lipoma has been 
characterized (Petit et al. (1996)). The gene affected on 

2 5 chromosome 3 is the LPP gene, which encodes a protein 

containing three LIM domains. LIM domains have been found in 
a growing number of proteins in mammals, amphibians, flies, 
worms, and plants and act as modular protein-binding 
interfaces (Schmeichel & Beckerle (1994)). LIM proteins 

30 mainly have a function in cell signalling and developmental 
regulation and include, for instance, transcription 
regulators, proto-oncogene products, and adhesion plaque 
constituents. The preferential t(3;12) in lipoma results in 
the formation of an HMGIC/LPP fusion transcript encoding a 

35 hybrid protein consisting of the three DNA binding domains 
of HMGI-C and the LIM domains of the protein encoded by LPP . 

In an approach to systematically unravel the molecular 
pathogenesis of benign tumors, the inventors have focused on 
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other cytogenetic subgroups of these, since their mere 
existence indicate the presence of additional critical 
genes. It is of interest to establish whether such genes are 
functionally similar or different as compared to the high 
5 mobility group protein genes HMGIC and HMGIY . In this con- 
text, the inventors have now molecular ly characterized the 
largest cytogenetic subgroup of pleomorphic adenomas. This 
type of neoplasm is an epithelial tumor occurring primarily 
in the major and minor salivary glands. It is by far the 

10 most common type of salivary gland tumor, accounting for 
almost 50 % of all tumors in these organs (Seifert et al. 
(1990)). Pleomorphic adenomas are almost exclusively benign 
tumors, which only rarely undergo malignant transformation. 
They show a marked histological diversity with epithelial 

15 and myoepithelial cells arranged in a variety of patterns in 
a matrix of mucoid, myxoid, chondroid and, on rare 
occasions, even osteoid tissues. Cytogenetically , 
pleomorphic adenomas are characterized by recurrent 
rearrangements, in particular reciprocal translocations with 

20 consistent breakpoints at 3p21, 8ql2 and 12ql3-15 (Mitelman 
(1994)). In addition to the cases with abnormal karyotypes, 
there is also a subgroup of tumors with apparently normal 
karyotypes (30 to 50% of the cases) . Abnormalities of 8ql2 
are most common and are found in about 60% of the cases with 

2 5 abnormal stemlines, whereas rearrangements of 3p21 and 

12ql3-15 are found in about 30% and 20% of the cases, 
respectively. The most frequent and characteristic 
abnormality so far observed in pleomorphic adenomas is a 
reciprocal t (3 ; 8 ) (p2 1 ;ql2 ) translocation (Mark et al. 
30 (1980)). While chromosome 3p is the preferential 

translocation partner, most human chromosomes have been 
found as translocation partner of 8ql2. Rearrangements of 
8ql2 are often found as the sole anomalies, indicating that 
they represent primary cytogenetic events of possible 

3 5 pathogenetic importance. 

Here, the inventors describe the positional cloning of 
the 8ql2 translocation breakpoint as well as the identifica- 
* tion and characterization of the genes on chromosome 8ql2 
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and 3p21 disrupted by the t(3;8) translocat ion. The. gene on 
chromosome 8ql2 is a novel zinc finger gene, which we have 
designated PLAG1, whereas the gene on 3p21 is CTNNB1 which 
is the translocation partner, the gene for B-catenin, a 
5 protein with an established role in cell adhesion and signal 
transduction (Peifer (1993)), By Southern blot and RACE 
analysis, we pinpoint the breakpoints within the introns of 
the 5'-noncoding regions of both genes and show that well- 
defined fusion transcripts are expressed in the tumor cells. 

10 We also demonstrate that the t(3;8) translocation results in 
specific disruption of the transcriptional control of the 
two genes, leading to activation of PLAG1 and down- 
regulation of CTNNB1. A pleomorphic adenoma with 8ql2 but 
not 3p21 involvement has also been studied to evaluate the 

15 phenomenon of disruption of transcriptional control in a 
broader perspective. 

2 > Materials and methods 

2.1 Tumor material and chromosome analysis 

2 0 Primary pleomorphic adenomas of the salivary glands 

were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were made and analyzed 
as described by Roijer et al. (1996). Primary pleomorphic 
adenomas, including CG368, CG580, CG644, CG752, CG753 and 

25 T9587, which all carry the recurrent t ( 3 ; 8) (p2 1 ;ql2 ) as the 
sole anomaly, were selected for molecular analysis, as well 
as CG682, showing an ins (8 ; 3 ) (ql2 ;p2i . 3pl4 . 1) , and CG588 
which carries a t (8 ; 15) (ql2 ;ql4) . CG tumors are obtained 
from Department of Pathology, Goteborg University, 

30 Sahlgrenska University Hospital, S-41345 Goteborg, Sweden. 
The tumor identified as T9587 is obtained from University 
Hospital Ghent, Belgium. FISH analysis was performed as 
previously described (Roijer et al. (1996)). Slides were 
examined in a Zeiss Axiophot epif luorescence microscope 

35 using the appropriate filter combinations. Fluorescence 
signals were digitalized, enhanced and analyzed using the 
ProbeMaster FISH image analysis system (Perceptive 
Scientific Instruments, Houston, Texas). Color prints were 
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produced using a Kodak XL 7700 monochrome continuous 
printer. 



2.2 Preparation and analysis of DNA and RNA 

5 Extraction of genomic DNA and Southern blot analysis 

were performed as described previously (Schoenmakers et al. 
(1994))* Total RNA was extracted from biopsy samples using 
the TRIZOL (Gibco/BRL) method and Northern analysis was 
performed according to standard procedures (Sambrook et al. 

10 (1989)). DNA from YACs, cosmids, PCR'products, and 

oligonucleotides was labelled using a variety of techniques. 
For FISH, cosmid clones or inter-Alu PCR products of YACs 
were biotinylated with biotin-ll-dUTP (Boehringer) by nick 
translation. For filter hybridizations, probes were radio- 

15 labelled with a- 32 P-dCTP using random hexamers (Feinberg & 
Vogelstein (1984)). In case of PCR-products of T-genes of 
the invention smaller than 200 bp in size, a similar 
protocol was applied, but T-gene specific oligonucleotides 
were used to prime labelling reactions. Oligonucleotides 

2 0 were labelled using y- Z2 P-ATP . 

2.3 YAC, cosmid, phage, and cDNA libraries 

YAC clones in this paper were isolated from the CEPH 
mark 1 YAC library (Albertsen et al. (1990)), using a 

2 5 combination of PCR-based screening and colony hybridization 
analysis (Green & Olson (1990)). YAC DNA was isolated and 
characterized as described before (Schoenmakers et 
al. (1995) ; Schoenmakers et al. (1994)). Cosmid clones were 
isolated from an arrayed human chromosome 8-specif ic cosmid 

30 library (Wood et al. (1992)) obtained from Los Alamos 

National Laboratory (LANL) . LANL-derived cosmid clones are 
indicated by their unique microtiter plate addresses. Phage 
clones were derived from a genomic library constructed with 
Li-14/SV40 DNA (Schoenmakers et al. (1994)) in XFIXII 

35 according to standard procedures. Cosmid and phage DNA was 
extracted using standard techniques involving purification 
over Qiagen tips (Qiagen) . Positive cDNA clones were - 
identified in a mixed poly-dT/random-primed library in Agtll 
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constructed from human fetal kidney (Clontech) . Library 
screening was performed by plaque hybridization using 
various DNA probes, derived from subsequently isolated cDNA 
clones, according to the manufacturers instructions. 

5 

2.4 DNA sequencing and computer analysis 

Nucleotide sequences were determined according to the 
dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia/LKB or the dsDNA Cycle Sequen- 
10 cing System (GIBCO/BRL) . Sequencing results were analyzed 
using an A.L.F. DNA sequencer™ (Pharmacia Biotech) on 
standard 30 cm, 6% Hydrolink*, Long Range™ gels (AT Bio- 
chem) . Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI) . 

15 

2.5 PCR amplification of genomic DNA 

PCR amplifications were carried out essentially as 
described before (Schoenmakers (1994)). The following 
amplimers were used to generate a PLAG1 exon 1 probe (5 f -CAA 

20 TGG CTG CTG GAA AGA GG-3 1 and 5 1 -CCC GTC CGC CGC CTC TAG 
ACC-3'), a PLAG1 ORF probe (5'-CGT AAG CGT GGT GAA ACC AAA 
C-3 ' and AGG GTC GTG TGT ATG GAG GTG A-3 ' ) , a PLAG1 3 1 -UTR 
probe (5' -ACA TGG CAT TTC GTG TCA CT-3 1 and 5 • -CCA CAA TGG 
CTC TAG AT-3') and a CTNNB1 exon 1 probe (5* -TGT GGC AGC AGC 

25 GTT GGC CCG GC-3 1 and 5 • -CTC AGG GGA ACA GGC TCC TC-3 ' ) . 

2.6 Rapid amplification of cDNA ends (RACE) 

Rapid amplification of 3' cDNA-ends (3 , -RACE) was 
performed using a slight modification of part of the 

30 GIBCO/BRL 3 • -ET protocol. For first strand cDNA synthesis, 
adapter primer (AP2) AAG GAT CCG TCG ACA TC(T)17 was used. 
For both initial and secondary rounds of PCR, the universal 
amplification primer (UAP2) CUA CUA CUA CUA AAG GAT CCG TCG 
ACA TC was used as "reversed primer". In the first PCR round 

3 5 the following specific "forward primers" were used: i) 5 1 - 
CAA TGG CTG CTG GAA AGA GG-3 • (exon 1) or ii) 5 • -AGA ATT TGG 
GCC TCA GAC AAG ATA-3 ■ (3' -UTR, exon. 5). In the second- PCR 
round the following specific forward primers (nested primers 
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as compared to those used in the first round) were used: i) 
5 '-CAU CAU CAU CAU GGC CGG AGG GAG GAT GTT AA-3 1 (exon 1) or 
ii) 5* -CAU CAU CAU CAU ATT GTC CTG GGT TGA TTA TGC AT-3 • 
(3'-UTR, exon 5). CUA/CAU-tailing of the nested, specific 
5 primers allowed the use of the directional CloneAmp cloning 
system (GIBCO/BRL) • 

For 5' -RACE experiments, the Marathon cDNA Amplificati- 
on kit (Clontech) was used according the manufacturer's 
instructions with minor modifications. The 5 1 -untranslated 

10 end of the normal PLAG1 transcript as well as the chimeric 
transcripts were isolated by 5' -RACE, First strand placenta 
or adenoma cDMA respectively, was synthesized from 5 /ig 
total RNA using the MV2 primer (5" -CTG CAC TTG ACC CAC CCC 
TTG GAT— 3 * ) located in exon 5. The ds cDNA was ligated to 

15 the adaptor and amplified using the anchor primer API and 
the MVS primer (5*-CAG GAG AAT GAG TAG CCA TGT GC-3 1 ) also 
located in exon 5. A second round of PCR was performed using 
the anchor primer and the MV6 primer (5'-TGC ACT TGT AGG GCC 
TCT CTC CTG-3 1 ) located in exon 4. The final PCR products 

2 0 were purified out of agarose gel and cloned into the pCRII 
vector (Invitrogen) . 

2 . 7 RT-PCR 

Total RNA (5 jig) was reverse-transcribed using Super- 
25 script II reverse transcriptase (GIBCO BRL) and oligo d(T) 
primers according to the recommended conditions. 0.2 5 /ug of 
the resulting cDNA was subject to amplification using a 
variety of primer sets. The amplification conditions for the 
CTNNB 1 / PLAG 1 fusion transcripts were 30 cycles at 94 °C for 
30 10 sec and 68 °C for 1 min in a final volume of 50 Ml using 
the Expand long template PCR system (Boehringer Mannheim) . 
The first round PCR was carried out with the CTNNB 1 primer 
5' -TGT GGC AGC AGC GTT GGC CCG-3 ' (CAT-UP) and the PLAG1 
primer 5'-CAG GAG AAT GAG TAG CCA TGT GC-3 1 (MVS) . The 
35 second round was performed on a 20 fold diluted sample with 
the CTNNB 1 primer 5 ' -ACG GAG GAA GGT CTG AGG AGC AG-3 • 
(NECAT-UP) and the PLAG1 primer 5 1 -TGC ACT TGT AGG GCC TCT 
CTC CTG-3' (MV6) . To amplify the reciprocal FLAG1/ CTNNB 1 
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fusion transcript two rounds of PCR amplification were 
performed with 3 0 cycles at 94 °C for 3 0 sec, 63 °C for 3 0 
sec and 72 °C for 1 min in a final volume of 50 /il. The 
first round was carried out with the PLAG1 primer 5 # -CAA TGG 
5 CTG CTG GAA AGA GG-3 • (START-UP) and the CTNNB1 primer 5'- 
AAG GAG CTG TGG TAG TGG CAC-3 • (CAT3) . The second round was 
performed on a 20 fold diluted sample with the PLAGl primer 
S'-GGC CGG AGG GAG GAT GTT AA-3 ' (START-RACE) and the CTNNB1 
-primer 5 • -GCC GCT TTT CTG TCT GGT TCC A-3 1 (CAT3NEST) . 

10 

3 . Results 

3 . 1 The chromosome 8ql2 breakpoint cluster region in pleo- 
morphic adenoma 

As part of a positional cloning effort to isolate the 

15 gene(s) affected by the t (3;8) (p21;ql2) in pleomorphic 

adenomas, we constructed a yeast artificial chromosome (YAC) 
contig consisting of 34 overlapping YACs, spanning about 2 
Mb within band 8ql2 (to be published elsewhere). A long 
range STS and rare cutter physical map of this region was 

2 0 also developed. In previous experiments the t(3;8) break- 
point was mapped to a 1 Mb region flanked by MPS as proximal 
and by D8S166 as distal marker. One YAC within this region 
was shown to span the t(3;*8) breakpoint in two tumors (CG588 
and CG644) . To further narrow down the breakpoint region we 

25 isolated cosmids corresponding to landmarks within this and 
other YACs in our contig (Fig. 3A) . By FISH, we could 
demonstrate that the breakpoints in four out of four adeno- 
mas tested (CG580, CG588, CG644 and CG682) clustered within "~ 
a 300 kb subregion located between MPS and STS CH129 (Fig. 

30 3A) . Two cosmids from this region, CEM23 and CEM48, were 
shown to span the breakpoints in adenomas CG644 (Fig. 3B) 
and CG682 (data not shown) , respectively. To refine the 
distribution of breakpoints within this 300 kb subregion, we 
developed a contig consisting of 29 phage and cosmid clones 

35 (Fig. 3A) . 

3.2 Identification and characterisation of the PLAGl gene 
in the 8ql2 breakpoint region 
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While initiating experiments to identify candidate 
genes mapping within the 300 kb genomic segment harbouring 
the adenoma breakpoints, BLAST searches (Altschul et al. 
(1990)) of newly obtained STSs mapping within the 300 kb 
5 contig revealed that the right end (CH283) of YAC clone 

14 3D5 displayed sequence identity with a publicly available 
EST (expressed sequence tag) (GenBank Accession number 
D59273) . Its position in the contig raised the possibility 
that this EST belonged to the candidate pleomorphic adenoma 

10 gene f PIAG1 ) , we were searching for. 

In initial Northern blot experiments using a PCR probe 
corresponding to this particular YAC end, a 7.5 kb 
transcript was detected. A combination of sequential scree- 
nings of a human fetal kidney cDNA library as well as 5'- 

15 and 3 • -RACE experiments led to the isolation of a composite 
cDNA of 7313 nucleotides (GenBank accession number U65002) . 
This cDNA contains an open reading frame (ORF) of 1500 bp 
starting with the ATG at position 481-483 (Fig. 4A) . An in 
frame stop codon (TAG) is present 9 nucleotides upstream of 

2 0 this ATG. The deduced amino acid sequence reveals seven 
canonical C2H2 zinc finger domains (Fig.. 4B) and a non- 
finger region of 2 59 amino acid residues representing the 
carboxy-terminus of the deduced protein. The zinc finger 
motifs including the linker sequences are between 28 and 35 

25 amino acids long. The cysteine (C) and histidine (H) residu- 
es are present in their characteristic positions in each 
finger. The typical phenylalanine (F) and leucine (L) resi- 
dues in finger 4 and 6 are lacking. Strictly spoken, the 
deduced* protein is not a Kruppel zinc finger protein, since 

30 it does not contain the characteristic H/C linker (consensus 
sequence TGEKPYK) in between the zinc fingers (Bellefroid et 
al. (1989)). Only the seven amino acids between finger 1 and 
2 resemble the H/C linker (TGERPYK) . The amino-terminal 
region contains two nuclear localization signals (KRKR and 

35 KPRK) . The carboxy-terminus is serine-rich (45 amino acid 
residues out of 259, i.e 17%), raising the possibility of a 
regulatory role that may be controlled by serine/threonine 
kinases. 
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Outside the ORF, some interesting features in the cDNA 
can be observed. In the 3 1 untranslated region (UTR) , which 
is 53 33 bp long, a polyadenylation signal is present star- 
ting at position 7297, and there is also a TG-repeat, which 
5 might be of regulatory relevance. In addition, there is an 

ATTTA sequence motif, which has previously been shown to \ 
mediate mRNA destabilization in certain lymphokine and 
immediate early genes (Sachs (1993)). The ATTTA motif is 
repeated twelve times in the 3 , -UTR. 

10 No sequence similarity was found to any of the known 

zinc finger proteins but preliminary studies have indicated 
that the human genome contains at least one additional gene 
that is closely related to our candidate pleomorphic adenoma 
gene. Nine anonymous ESTs (expressed sequence tags) with 

15 sequence similarity to sequences of our new gene were found 
in public databases and these ESTs could be aligned resul- 
ting in a contiguous sequence of about 1 kb. An open reading 
frame appeared to be present with coding potential for 
protein sequences with zinc-finger-like features. Sequence 

20 identity between the aligned ESTs and our new gene appeared 
to be about 75% at the amino acid sequence level; i.e. in 
the region of our gene encoding zinc fingers 4 to 7 . Of 
interest to note furthermore is that these anonymous ESTs 
sequences have been mapped to chromosome 6q2 4, which is 

25 implicated in tumors of the salivary glands. 

3.3 Genomic organization of the PLAG1 gene 

Comparison of transcribed and genomic DNA sequences of 
the PLAG1 gene revealed that it contains 5 exons (Fig. 3A) . 

30 The transcriptional orientation of the gene is directed 

towards the centromere. The first three exons are noncoding, 
the fourth exon contains the translation start site (ATG) , 
and the amino-terminus of the protein including one complete 
zinc finger domain. The second finger is split by intron 4 

35 and continues into exon 5, which contains the remaining part 
of the ORF and the long 3 ' -UTR (5533 bp). Based on our YAC 
and cosmid maps, the PLAG1 locus spans about 3 5 kb, with a 
large intron (approximately 25 kb) between exon 1 and exon 
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2. 3' -and S^RACE analysis revealed the existence of two 
alternatively spliced mRNAs which differ from each other by 
the presence or absence of the noncoding exon 2, The diffe- 
rence in electrophoretic mobility of the two mRNA isoforms 
5 is too small to distinguish them by Northern blot analysis. 
The functional relevance of this alternative splicing re- 
mains to be elucidated. 

3.4 Rearrangements of PLAG1 in primary pleomorphic adenoma 
10 In an attempt to establish whether the 8ql2 aberrations 

in primary pleomorphic adenomas with t (3 ;8) (p21;ql2) led to 
rearrangements in PLAGl, Southern blot experiments were 
performed using various PLAGl-derived probes. With a probe 
within the ORF (STS probe EM265) or the 3 , -UTR (KK64) of 
15 PLAGl e no rearrangements were detected. However, using 
probes corresponding to the 5* noncoding region of PLAGl . 
i.e. intron 1 (STS probes EM416 or EM317) or exon 2 (STS 
probe EM44 0) , rearrangements were detected in each of the 
t(3;8) tumors tested (data not shown). Similar Southern blot 

2 0 analysis of primary pleomorphic adenoma CG580, which carries 

a t (8;15) (ql2;ql4) , revealed also a rearrangement in the 5 1 
noncoding region of PLAGl (Fig. 5) . These results seem to 
directly implicate PLAGl as a critical gene in pleomorphic 
adenoma tumor igenes is . Furthermore, the mechanism might 
25 involve transcriptional deregulation of PLAGl , since the 
rearrangements in PLAGl were invariably found in the 5' - 
noncoding region of PLAGl leaving its coding region intact. 

3.5 t(3;8) results in promoter swapping between PLAGl and 
30 the gene for 6-catenin, CTNNB1 

The observation that the 8ql2 translocation breakpoints 
in various primary tumors were mapped to intron 1 or intron 
2 of the 5' -noncoding region of PLAGl raised the possibility 
that the t(3;8) results in the production of a chimeric 

3 5 transcript consisting of PLAGl sequences fused to those of a 

gene located on chromosome 3p21. To test this possibility, 
5' -RACE experiments were performed using total RNA of -the 
primary tumors CG644 and CG682, which both carry a 3; 8- 
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rearrangement. We used a primer specific for exon 5 (MV2) in 
the cDNA synthesis. Following ligation of an adaptor to the 
cDNA, PCR amplification was performed using, in the first 
round, an adaptor-specific primer and a PLAGl-specif ic 
5 primer corresponding to sequences of exon 5 (MVS) and, in a 
second round, nested adaptor-specific primer and PLAGl- 
specif ic primer corresponding to sequences of exon 4 (MV6) . 
Nucleotide sequence analysis of the PCR product revealed 
that the ectopic sequences were fused to the acceptor splice 

10 site of exon 3 of PLAG1. BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNB1 (Nollet et al. 
(1996)), the gene for B-catenin, which has previously been 
assigned to chromosome 3p21 (Kraus et al. (1994)). 

Independent evidence for the involvement of CTNNB1 in 

15 the t (3 ; 8) -rearrangements in pleomorphic adenomas was obtai- 
ned by FISH using two YACs (756G5 and 750D3) , which contain 
the complete gene for B-catenin. According to published maps 
YAC 7 56G5 is contained within the proximal part of mega-YAC 
750D3 (Gemmill et al. (1995)). As expected, both YACs 

20 spanned the 3p21 breakpoint in adenoma CG682, with 

hybridization signals on the normal 3, the der(3) and the 
der(8) (data not shown). Detailed analysis of a few 
prometaphase cells enabled us to sublocalize the B-catenin 
locus to band 3p21.3, which is in accordance with previous 

25 mapping data. 

To extend our observation to a larger number of tumors, 
we applied a reverse transcription-polymerase chain reaction 
(RT-PCR) approach. RT-PCR amplifications using primers 
specific for CTKNB1 and PLAG1 were carried out with RNA from 

30 tumors CG368, CG588, CG644, CG752 , CG753 and T9587, which 
all carry the recurrent t(3;8), and from tumor CG682, which 
carries an ins ( 8 ; 3 ) (ql2 ; p21 . 3pl4 . 1) . RNA from tumor CG580, 
which carries a t(8;15), was included as a negative control. 
PCR experiments resulted in the generation of PCR products 

3 5 corresponding to hybrid transcripts consisting of PLAG1 and 
CTNNB1 sequences, in seven out of seven t(3;8) tumors analy- 
zed (Fig. 6). In tumors CG368, CG588, CG682, CG752, and 
T9587, PCR products of 509 bp (Fig. 6A, PCR product A) and 



614 bp (Fig. 6A, PCR product B) were generated, whereas in 
tumors CG644 and CG753, only the PCR product of 509 bp was 
found. The PCR product of 509 bp (from NECAT-UP up to MV6) 
is corresponds to a hybrid transcript containing exon 1 of 
5 CTNNB1 and exons 3 to 5 of PLAG1 . The PCR product of 605 bp 
contains an extra 105 bp, which corresponds to the alterna- 
tively spliced exon 2 of PLAG1. It points towards the pre- 
sence of a related isoform consisting of exon 1 of CTNNB1 
and exons 2 to 5 of PLAG1 . This was also confirmed by nucle- 

10 otide sequence analysis of the PCR products. We were also 

able to demonstrate that the corresponding reciprocal fusion 
transcripts are expressed. In all tumors except CG682, a PCR 
product of 13 0 bp was generated corresponding to a fusion 
transcript consisting of exon 1 of PLAG1 and exons 2 to 16 

15 of CTNNB1 . In tumor CG753, an additional PCR product was 

detected, corresponding to a fusion transcript consisting of 
exons 1 to 2 of PLAG1 and exons 2 to 16 of CTNNB1. This 
additional band was also observed but with weak intensity in 
tumor CG644. All these results indicate that in CG644 and 

20 CG753, the 8ql2 translocation breakpoints are located in 
intron 2, whereas the breakpoints of tumors CG3 68, CG588, 
CG682, CG752, and T9587 are located in intron 1. Interestin- 
gly, using 5 1 RACE analysis with the tumor CG580, which 
carries a t(8;15) translocation, we found that the break- 

25 point occurs also in the same region leading to a fusion 
transcript with ectopic fused to exon 3 of PLAG1 . 

3.6 Activation of PLAG1 expression due to promoter swapping 
Northern blot analysis was performed to evaluate the 

30 effect of the t (3;8) (p21;ql2) translocation on the expressi- 
on levels of PLAG1 and CTKNB1 (Fig. 7) . As previously menti- 
oned, FLAG I is expressed as a 7 . 5 kb transcript, which was 
readily detected in human fetal lung, liver, and kidney but 
not in fetal brain. In adult tissues, the 7 . 5 kb transcript 

35 was not detected in human heart, brain, lung, liver, skele- 
tal muscle, kidney, pancreas, and salivary gland. Low levels 
of PLAG1 expression was found in human placenta (Fig. _7A) . 
In contrast, the CTNNB1 gene was ubiquitously expressed as a 



3.8 kb RNA doublet in all tissues tested (Fig, 7B) . Similar 
results were obtained in the Northern evaluation of fetal 
and adult mouse tissues (data not shown) . 

Northern blot analysis was performed on two pleomorphic 

5 adenomas with t(3;8) and on one tumor with a variant t(8;15) 
(Fig. 7C) . In contrast to normal salivary gland tissue in 
which no PIAG1 transcripts could be detected, the three 
tumors expressed a 7.5 kb transcript. This transcript does 
not correspond to the normal PLAG1 transcript (exons 1 to 

0 5) , since an exon 1-specif ic probe failed to hybridize to 
this transcript (data not shown) . Instead, a probe specific 
for exon 1 of CTNNBl recognized this 7.5 kb transcript as 
well as the 3.8 kb doublet of CTNNBl transcripts in the two 
tumors with a t(3;8). In contrast, only the 3 . 8 kb doublet 

5 of CTNNBl transcripts was detected in adenoma CG580 with a 
variant t(8;15). The expression level of CTNNBl in this 
tumor was about twofold higher as compared to tumors CG644 
and CG682, which is expected since none of the CTNNBl alle- 
les is rearranged. Collectively, these results suggest that 

0 PLAG1 expression in these pleomorphic adenomas is driven by 
the 5' regulatory sequences either of the CTNNBl gene for 
the cases with t(3;8) or of an as yet unknown gene on chro- 
mosome 15 for the case with t(8;15). These results indicate 
that the principal mechanism in pleomorphic adenomas with 

5 8ql2 rearrangements is activation of PLAG1 expression, due 
to promoter swapping between PLAG1 and a translocation . 
partner gene, preferentially CTNNBl . Furthermore, expression 
of the translocation partner gene is likely to be down- 
regulated concomitantly; e.g. as demonstrated for CTNNBl . 

0 

4^_ Discussion 

We have characterized a novel gene, PLAG1, which we 
propose is a critical locus involved in the development of 
pleomorphic adenoma of the salivary glands. The gene was 
5 identified on the long arm of chromosome 8, close to the MPS 
proto-oncogene, as a result of a positional cloning project 
to molecularly define the t ( 3 ; 8) (p21 ;ql2 ) , which is the most 
frequent chromosome aberration in pleomorphic adenomas. We 
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have also identified the translocation partner gene at 3p21, 
i.e. the CTNNB1 gene which encodes B-catenin. In addition to 
8ql2 and 3p21, chromosome rearrangements involving 12ql3-15 
are also frequently observed in pleomorphic adenomas. The 
5 high mobility group protein gene HMGIC was recently identi- 
fied as the gene consistently rearranged in these cases 
(Schoenmakers et al. (1995)). In conclusion, we have 
molecular ly defined the three main chromosomal 
rearrangements implicated in the genesis of pleomorphic 
10 adenomas. 

Structural rearrangement of the 8qll-13 region are 
consistently found also in benign lipoblastomas . These 
tumors are considered to be unique variants of lipoma, 
occurring primarily in children before 3 years of age. Among 
15 the seven cases so far analyzed cytogenetically , rearrangem- 
ents involving 8qll-13 were observed in six cases (Mitelman 
(1994)). Similar to pleomorphic adenomas, the translocation 
partners of 8ql2 are variable. The similarities in 
chromosomal pattern between these two tumor types suggest 

2 0 that PLAG1 is the target of the 8qll-13 rearrangements also 

in lipoblastomas. 

We have also elucidated the molecular consequences of 
the t(3;8) translocation. The 8ql2 translocation breakpoints 
were invariably found in the 5 1 -non-coding region of PLAG1 
25 in all seven primary tumors tested. The breakpoints in the 
CTNNB1 gene were always found in the first intron. As a - 
result of the translocation, the coding sequences of PLAG1 
are brought under control of the 5 ■ regulatory sequences of 
the CTNKB1 gene, and vice versa, resulting in the activation 

3 0 of PLAG1 and down-regulation of CTNNB1. Based on their 

expression patterns in normal fetal and adult tissues, it is 
clear that the regulation of expression of the two genes is 
very different. The CTNNB1 gene is highly and ubiquitously 
expressed, whereas PLAG1 is developmentally regulated, with 
3 5 expression restricted to fetal tissues. It should be empha- 
sized that in adult tissues, PLAG1 is either not expressed 
or expressed at very low levels. In the adenomas, both the 
CTNNB1 7PLAG1 and the reciprocal PLAG1/CTNNB1 transcripts 
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were detected by RT-PCR; detection of the latter transcripts 
sometimes required three rounds of PGR, indicating that the 
expression levels are very low. Our findings represent the 
first example of reciprocal exchange of expression control 
5 elements in solid tumors. Since the coding sequences of both 
genes are invariably preserved, the molecular mechanism 
could be classified as promoter swapping, resulting in 
activation of PLAG1 and down-regulation of CTNNB1 
expression. 

10 Since the t(3;8) in pleomorphic adenomas invariably 

leads to activation of PLAG1, it is of interest to evaluate 
possible physiological implications. The PLAG1 gene encodes 
a protein with a deduced molecular weight of 56 kDa and a 
deduced pi of 8.56. Analysis of the open reading frame of 

15 PLAG1 reveals seven zinc fingers in the amino-terminal 

region. The carboxy-terminal region is rich in serine resi- 
dues. Furthermore, two potential nuclear localization sig- 
nals are present (residues 22-25 and 29-32) . Collectively, 
this suggests that the PLAG1 protein is a novel member of 

20 the large zinc finger gene family. Zinc finger motifs were 
originally identified as DNA binding structures in the RNA 
polymerase III transcription factor TFIIIA, which binds to 
the internal control region of the 5S RNA gene. TFIIIA-like 
zinc fingers are present in a variety of regulatory proteins 

25 found in higher and lower eukaryotes. This type of zinc 

finger motif consists of ^30 amino acids with two cysteine _ 
and two histidine residues (C2H2) that stabilise the domain 
by tetrahedrally coordinating a Zn 2 * ion. A region of ^12 
amino acids between the invariant cysteine-histidine pairs 

3 0 is characterized by scattered basic residues and several 
conserved hydrophobic residues. Most zinc finger genes 
studied so far encode polymerase II transcription factors. 
Apart from transcriptional modulation and control of RNA 
metabolism, chromatin packaging might also constitute an 

3 5 important activity through which zinc finger proteins exert 
their regulatory roles (El-Baradi & Pieler (1991)). It 
should also be noted that the presumed role of HMGIC .is also 
in chromatin modelling (Schoenmakers et al. (1995); Ashar et 
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al. (1995); Wolffe (1994)). The mammalian genome is known to 
contain a large number of C2H2 zinc finger genes ( Belief roid 
et al. (1989)) and the number of such genes implicated in 
cancer is growing steadily. 
5 The function of the serine rich carboxy-terminal part 

of PIAGl is unknown but it may have a regulatory function 
that can be controlled by serine/threonine kinases. If PIAGl 
encodes a DNA binding protein, as the presence of its zinc 
fingers suggests, the carboxy-terminal region might 

10 represent the transactivation domain.- At least four 

different primary sequence motifs that characterize the 
activation domains are identified thus far, i.e. acidic, 
glutamine-rich, proline-rich and serine/ threonine-rich. They 
likely represent regions that functionally interact with 

15 other proteins (Mitchell & Tjian (1989)). 

Activation of PLAG1 due to promoter substitution may 
lead to deregulation of genes normally controlled by PIAGl . 
If overexpression of PIAGl is indeed a major molecular 
consequence of the 8ql2 translocations than this could 

20 possibly be achieved also by other mechanisms, including for 
instance an increase in gene copy number, In this context it 
should be noted that a small subgroup of pleomorphic adeno- 
mas show trisomy 8, often as the sole anomaly (Mitelman 
(1994)). In addition, trisomy 8 is a common numerical 

25 abnormality found in several histological subtypes of 

malignant salivary gland tumors as well as in different, 
subtypes of leukemias and sarcomas (Mitelman (1994)). 

The preferential fusion partner of PIAGl is the CTNNB1 
gene, which encodes 6-catenin, a cytoplasmic protein of 

30 about 88 kDa (Gumbiner & McCrea (1996)). Its frequent 
involvement in pleomorphic adenomas as found here might 
point towards a critical role of CTNNB1 . Most likely, a role 
of CTNNB1 is to provide an active promoter in front of the 
PIAGl gene. However, since the observed promoter swapping 

3 5 between PIAG and CTNNB1 leads to down-regulation of CTNNB1 
expression, it may also lead to other physiological 
consequences, especially since B-catenin is a broad-range 
protein interface, as that has been implicated in highly 
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diverse processes, such as human colon cancer, epithelial 
cell adhesion, embryonal axis formation in Xenopus . and 
pattern formation in Drosophila (Peifer (1993)). The S- 
catenin protein has also been found as structural component 
5 of adherens junctions (AJs) (Peifer (1993); Kemler (1993); 
Gumbiner (1996)), which are multiprotein complexes assembled 
around Ca 2 *-regulated cell adhesion molecules (cadherins) . 
B-Catenin binds directly to cadherins and acts as a protein 
interface between cadherin and the cytoskeleton. The 

10 cadherin-B-catenin complex mediates cell adhesion, 

cytoskeletal anchoring, and signalling, which are important 
processes for regulation of cell growth and behaviour. It 
has been suggested that AJ complexes may play a role in the 
transmission of signals for contact inhibition (Peifer 

15 (1993)). Down-regulation of B-catenin, as occurs in 

pleomorphic adenomas, could interfere with this transmission 
and lead to growth deregulation. Since B-catenin levels 
apparently are important physiologically, it might be that 
down-regulation of CTNNB1 expression plays some role in 

20 pleomorphic adenoma development, possibly synergistically to 
PLAG1 . In conclusion, as activation of PLAG1 is observed in 
all pleomorphic adenomas with 8ql2 involvement tested, this 
seems to constitute a major factor in salivary gland tumor 
formation. The critical role of down-regulated levels of B- 

25 catenin remains to be established. 

Previous studies of HMGIC have shown that rearrange- 
ments of this gene preferentially occur in particular cyto- 
genetic subgroups of benign mesenchymal tumors, with the 
only exception so far being a subgroup of pleomorphic adeno- 

30 mas (Schoenmakers et al. (1995)). When considering the 

findings in the latter type of tumors, it should be noted 
that, although they are epithelial in origin, they often 
also show a varying degree of mesenchymal differentiation. 
This may be explained by the fact that pleomorphic adenomas 

3 5 are thought to originate from a pluripotent reserve cell in 
the terminal duct system which possesses the capacity to 
differentiate into both epithelial and myoepithelial cells 
(Evans & Cruickshank (1970)). The latter cells may function 



50 ' 

as facultative mesenchymal cells, and are responsible for 
the production of extracellular material, including myxo- 
chondroid stroma (Dardick et al. (1991)). An intriguing 
question in this context is of course whether HMGIC is 
5 preferentially affected in adenomas with a prominent stromal 
component while PliAGl is preferentially affected in tumors 
with little or no stroma. Histological re-examination of all 
but one of the adenomas with PLAG1 involvement in this 
study, revealed that all cases contained mesenchymal 

10 components, including both myxoid and chondroid tissues, 
suggesting that mesenchymal differentiation is not 
restricted to adenomas with HMGIC involvement. It should be 
noted that previous cytogenetic observations have already 
indicated that chromosome 12ql3-l5 and 8ql2 rearrangements 

15 constitute mutually exclusive pathways for the evolution of 
pleomorphic adenomas (Sandros et al. (1990)). 

The identification and cloning of a novel "benign 
oncogene" activated by chromosome translocations constitute 
a major step towards an increased understanding of the 

2 0 molecular pathogenesis of benign neoplastic growth. Our 

findings may lead to new insights into the genetic differen- 
ces between benign and malignant tumors. In addition, the 
present findings may have diagnostic implications. The fact 
that PLAG1 is not normally expressed in adult tissues, but 

25 activated in pleomorphic adenomas due to 8ql2 rearrange- 
ments, makes it a potentially useful tumor marker in the 
differential diagnosis of benign and malignant salivary 
gland tumors. On the basis of the fusion transcripts that 
are fonfted, highly specific diagnostic assays can be develo- 

30 ped, as will be illustrated in the following examples. 

FIGURE LEGENDS 
Figure 3 

A: Contig of 3 overlapping YACs (bold lines) , 27 cos- 

3 5 mids and 2 phages, containing 27 landmarks and spanning a 

300 kb DNA region on chromosome 8ql2. Contig elements are 
labelled or numbered and defined in the list below. Cosmid 
clones isolated from the arrayed chromosome 8-specific 
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cosmid library constructed at Los Alamos National Laboratory 
(LANL) (Wood et al. (1992)) are named after their unique 
microtiter plate addresses. #1 and #2 2 are genomic phage 
clones; #21 is a clone isolated from the non-arrayed LANL 
5 chromosome 8-specific cosmid library. The orientation of the 
contig on the long arm of chromosome 8 is given as well as 
the order of 27 landmarks. It should be noted that the 
contig is not scaled. Below the contig, the genomic 
-organization and the relative location of the PLAGl gene is 

10 given schematically, with exact sizes (bp) of its exons and 
estimated sizes (kb) of its introns. Noncoding sequences are 
represented as open boxes and coding sequences as black 
boxes. The relative positions of the translation initiation 
(ATG) and stop (TAG) codons in PLAGl are indicated. At the 

15 bottom of the figure, characteristics of the deduced protein 
encoded by PLAGl are given. Zinc fingers are labelled F1-F7. 
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(clone number, cosmid or phage clone, STS, accession number) 

30 

B: Mapping of the 8ql2 translocation breakpoint in an 
adenoma (CG644) with a t (3 ;8) (p21;ql2) . Cosmid CEM48 (white 
spots) was co-hybridized with an alpha-satellite probes one 
specific for chromosome 3 and the other for chromosome 8 
35 (black spots) . Hybridization signals were found on the 
normal 8, the der (3), and the der (8), indicating that 
CEM48 spans the t ( 3 ; 8 ) (p2 1 ;ql2 ) breakpoint. Chromosomes are 
stained in blue with DAPI. 

40 Figure 4 

A: cDNA and deduced amino acid sequence of PLAGl . The 
relative positions of exon/intron boundaries are indicated 
: by triangles (t) . The conserved C, F, L and H residues in 
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the zinc finger domains are underlined. Residues 22-25 and 
29-32 constitute two potential nuclear localization signals. 
A potential polyadenylation signal and the putative mRNA 
destabilizing ATTTA motifs in the 3'-noncoding region are 
5 underlined. The nucleotide sequence of the complete cDNA has 
been deposited at GenBank under accession number XXX. 

B: Alignment of the seven zinc-finger-like motifs found 
in the deduced amino acid sequence of PLAG1 relative to the 
C2H2 consensus motif. The canonical C, F, L and H residues 
10 are given in bold. 

Fi crura 5 

Illustrative example of the detection of DNA rearrange- 
ment in PLAG1 of a primary pleomorphic adenoma using Sout- 
15 hern blot analysis. DNA of pleomorphic adenoma CG3 68 (panel 

I) and control DNA isolated from normal lymphocytes (panel 

II) were digested with restriction endonuclease BamHI (B) , 
EcoRI (E) , Hindlll (H) , or PstI (P) , as indicated. A probe 
with specificity for exon 3 of PLAG1 (EM440) was used. The 

2 0 molecular weight markers are 23.1, 9.4, 6.5, 4.3, 2.3 and 
2.0 kb, respectively. 

Figure 6 

A: Detection of CTNNB 1 / PLAG 1 and PLAG 1/ CTNNB 1 fusion 
2 5 transcripts by RT-PCR in primary adenomas. I. CTNNB 1/ PLAG 1 
were detected using the RT-PCR protocol and primers descri- 
bed in detail in Methods. Primary tumors analyzed included 
CG368 (lane 1), CG588 (lane 2), CG644 (lane 3), CG682 (lane 
4), CG752 (lane 5), CG753 (lane 6), T9587 (lane 7), and 
30 CG580 (lane 8). II. PLAG 1 / CTNNB 1 fusion transcripts were 
detected similarly using the same samples as under M I'. 
Details of the primers used here are given in Methods Secti- 
on. PCR products are labelled A-D. 

B: Schematic representation of the nature and origin of 
35 CTNNB 1/ PLAG 1 and PLAG 1/ CTNNB 1 fusion transcripts in primary 
pleomorphic adenomas with t (3 ;8) (p21;ql2) . At the top of the 
figure, the exon/intron distribution of the PLAG1 gene is 
given, at the bottom, the exon/intron distribution for the 
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CTNNB1 gene. Positions of chromosome breakpoints are indica- 
ted by an arrow (N ) . Translation initiation sites are indi- 
cated by asterisks (*) and stop codons by triangles (t) . A- 
D: schematic exon compositions of hybrid transcripts, as 
5 established by 5 1 -RACE analysis. A: cDNA sequence junction 
between exon 1 of CTNNB1 and exons 3-5 of PLAG1 . B: cDNA 
sequence junction between exon 1 of CTNNB1 and exons 2-5 of 
PLAG1. C: cDNA sequence junction between exon 1 of PLAG1 and 
exons 2-16 of CTNNB1 . D: cDNA sequence junction between 
10 exons 1-2 of PLAG1 and exons 2-16 of CTNNB1 . 

Figure 7 

A: Northern blot analysis of the expression pattern of 
PLAG1 in normal human fetal tissues including brain (1), 
15 lung (2), liver (3), kidney (4) as well as adult tissues 
including heart (5), brain (6), placenta (7), lung (8), 
liver (9) , skeletal muscle (10) , kidney (11) , and pancreas 
(12) . 

B: Northern blot analysis of the expression pattern of 
2 0 the CTNNB1 gene in normal human fetal and adult tissues as 
described under "A". 

C: Detection of CTNNB1/PLAG1 transcripts in pleomorphic 
adenomas by Northern blot analysis using exon 1 of CTNNB1 as 
a molecular probe. Lane 1, RNA of CG644 (t(3;8)), lane 2, 
25 RNA of CG580 (t(8;15)), and lane 3, RNA of CG682 (ins 3p21 
(8ql2)). The CTNNB 1 /PLAG 1 fusion transcript is indicated. 

D: Using the same blot as under "C M , detection of 
CTNNB 1 / PLAG 1 transcripts using a probe with specificity for 
the 3» OTR of PLAG1 (probe KK64) . The CTNNB 1 / PLAG 1 trans- 
30 cript is indicated. 

EXAMPLE 3 

The discovery of PLAG1 made it possible to identify 
PLAGl-related genes. The possibility that the human genome 
35 contains such genes was raised by Southern blot data showing 
bands (weak hybridization signals) . In this example, the 
identification of another member of the PLAG gene family 
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that might be of pathogenetical relevance is described, i.e. 
the PLAG 2 gene. 

To isolate cDNA clones corresponding to other members 
of the PLAG1 gene family, a cDNA library in A. ZAP was used 
5 (Stratagene) . As molecular probe, the complete human PLAG1 
cDNA was used. Screening of the cDNA library was performed 
according to routine procedures using low stringency hybrid- 
ization conditions (Schoenmakers et al. (1994)). This 
resulted in the isolation of a number of overlapping cDNA 

10 clones presumably corresponding to another member of the 

PLAG family. From the inserts of these, a 2.7 kbp composite 
cDNA could be constructed that contained an open reading 
frame (1233 nucleotides) for a protein (411 amino acids) 
structurally similar to PLAG1 . This protein was designated 

15 PLAG 2 . The 2.7 kbp human PLAG 2 cDNA fragment started 17 6 
nucleotides upstream of the ATG start codon. 

The nucleotide sequence of the composite PLAG2 cDNA and 
the deduced amino acid sequence of the corresponding PLAG2 
protein are shown in Figure 8. Nucleotide sequences were 

2 0 determined according to the dideoxy chain termination method 
using the T7 polymerase sequencing kit of Pharmacia/LKB . DNA 
fragments, subcloned in the pBLUESCRIPT or pGEM-3Zf (+) 
vector, were sequenced using standard primers and primers 
synthesized based upon newly obtained sequences. Nucleotide 

2 5 sequence data were obtained from both strands and analyzed 

using the sequence analysis computer programs Genepro (Ri- . 
verside Scientific) , PC/Gene and Intelligenetics (IntelliGe- 
netics, Inc. ) . 

To' established that the isolated PLAG2 sequences cor- 

3 0 responded to expressed sequences, Northern blot analysis was 

performed according to routine procedures. Total RNA was 
extracted from various human tissues and cell lines using 
Trizol™ (GIBCO/BRL) . After precipitation, concentrations of 
RNA in the samples were estimated via spectroscopic determi- 
3 5 nation. 40 /ig of total RNA from each sample was run on 1% 
agarose gels containing 1 x MOPS (0.02 M MOPS (Sigma) pH 
7.0, 50 mM Na-acetate, 10 mM EDTA pH- 8.0), and 0.66 M -for- 
maldehyde (Sigma) in a 1 x MOPS buffer for 16 h at 4 V/cm. 
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After gel electrophoresis, RNA was transferred to Hybond N* 
filters (Amersham) using 10 x SSPE (20 x SSPE: 3.0 M NaCl, 
0.16 N NaH 2 P0 4 pH 7.4, 0.02 M EDTA) for standard capillary 
blotting. After completion of the transfers, nylon filters 
5 were irradiated with ultraviolet light using a Stratalinker 
(Stratagene) . Prehybridizations of the blots were carried 
out for 3 h at 42 °C in hybridization buffer consisting of 5 
x SSPE, 10 x Denhardt's, 2% SDS, 50% formamide and 100 /xg/ml 
herring sperm DNA. For hybridizations, blots were incubated 

10 overnight in the same buffer as described for the prehybrid- 
izations. Subsequently, blots were washed for 40 min at room 
temperature in 2 x SSC containing 0.05% SDS and for 40 min 
in 0.1 x SSC containing 0.1% SDS at 50 °C. Autoradiographs 
were obtained by exposing Kodak X-AR film at -80 °C with 

15 intensifying screen. 

In Northern blot analysis using the complete 2.7 kb 
PLAG2 cDNA as molecular probe and rather stringent hybrid- 
ization conditions, four distinct mRNAs were detected. These 
varied in size and included mRNAs of about 4.4 kb, 3.1 kb, 

2 0 2.9 kb, and 1.9 kb. The 3.1 and 2.9 kb transcripts were 
preferentially seen in fetal tissue, whereas the other 
transcripts were seen preferentially in adult tissues. These 
results established the expression of the PLAG2 sequences in 
a variety of cell types. Furthermore, the detection of 

2 5 various mRNAs with the PLAG2 cDNA probe suggests the exis- 

tence of various isoforms and raises the possibility that 
various PLAG2 protein isoforms may be produced. 

EXAMPLE' 4 

30 Detection of hybrid PLAG1 in salivary gland tumor cells. 

cDNA clones of the chromosome 3 -derived CTNNB1 gene 
were isolated and the nucleotide sequence thereof establis- 
hed. The nucleotide sequence data of a composite cDNA are 
shown in Fig. 9. The amino acid sequence of the CTNNB1- 

3 5 encoded protein was deduced. 

In 5 ' -RACE analysis of primary pleomorphic adenomas of 
the salivary glands, PLAG1 
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containing fusion trancripts were identified. 5 1 —RACE expe- 
riments were performed using total RNA of the primary, tu- 
mors. A primer specific for exon 5 (MV2) was used in the 
cDNA synthesis. Following ligation of an adaptor to the 
5 cDNA, PCR amplification was performed using, in the first 
round, an adaptor-specific primer and a PLAGl -specific 
primer corresponding to sequences of exon 5 (MVS) and, in a 
second round, nested adaptor-specific primer and PLAGl - 
specific primer corresponding to sequences of exon 4 (MV6) . 

10 Nucleotide sequence analysis of the PCR product revealed 

that the ectopic sequences were fused to the acceptor splice 
site of exon 3 of PLAGl. BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNB1, the gene for 
6-catenin, which has previously been assigned to chromosome 

15 3p21. 

EXAMPLE 5 

Diagnostic test for pleomorphic adenoma of the salivary 
glands 

20 Fine needle biopsies of patients having a pleomorphic 

adenoma of the salivary gland (with a chromosome 8ql2 
aberration others with a normal karyotype) were taken. From 
the material thus obtained total RNA was extracted using the 
standard TRIZOL™ LS protocol from GIBCO BRL as described in 

25 the manual of the manufacturer. This total RNA was used to 
prepare the first strand of cDNA using reverse transcriptase 
(GIBCO/BRL) and an oligo dT(17) primer containing an 
attached short additional nucleotide stretch. The sequence 
of the primer used is as described in Example 2, under point 

30 2.6 [AAG GAT CCG TCG ACA TC(T)17]. RNase H was subsequently 
used to remove the RNA from the synthesized DNA/RNA hybrid 
molecule. PCR was performed using a gene-specific primer 
(Example 2, point 2.6) and a primer complementary to the 
attached short additional nucleotide stretch. The thus 

3 5 obtained PCR product was analysed by gel electrophoresis. 
Fusion constructs were detected by comparing them with the 
background bands of normal cells of the same individual. 
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In an additional experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 
primer complementary to the short nucleotide stretch [AAG 
GAT CCG TCG ACA TC(T)17]. The sensitivity of the test was 
5 thus significantly improved. 

EXAMPLE 6 

Diagnostic test for uterine leiomyoma. 

10 Uterine leiomyoma is the most common pelvic tumor in 

women, occurring with an incidence up to 77% of women of 
reproductive age when tumors are counted after 2 millimeter 
serial sectioning of consecutive hysterectomy specimens. 
Often multiple leiomyomas are present, with estimates as 

15 high as an average of 6.5 tumors per uterus. Although most 
patients with these steroid-dependent tumors are asymptoma- 
tic, symptomatic leiomyomas can be associated with abnormal 
uterine bleeding, pelvic pain, urinary dysfunction, sponta- 
neous abortions, premature delivery and infertility. The 

2 0 high incidence of this benign smooth muscle tumor constitu- 
tes a major public health problem with the diagnosis of 
myomatous uterus leading to over 200,000 hysterectomies 
performed annually in the United States. Furthermore, non- 
random inactivation of the X chromosome has clearly demon- 

2 5 strated that uterine leiomyomas are monoclonal proliferati- 
ons. Although leiomyomata typically are benign tumors, in a 
very low percentage (< 0.1%) they might be able to metasta- 
size or even progress to malignancy. 

Besides a normal karyotype which is being found in 

30 approximately 70% of the cases investigated, several cytoge- 
netically abnormal subgroups, can be distinguished among 
leiomyomata. Leaving out of consideration the group which 
shows random changes, one of the largest cytogenetic sub- 
groups (comprising approximately 25% of the cytogenetically 

35 abnormal tumors) was first described in 1988 and is charac- 
terized by the involvement of 12ql5 and/or 14q23-24, mainly 
as t(12;14) (ql4-15;q23-24) . Another subgroup, with a similar 
incidence, contains deletions involving the long arm of 
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chromosome 7, with region q21-22 being the most probable 
commonly involved region. Another subset of leiomyomas is 
characterized by numerical aberrations, mainly trisomy 12. 
This trisomy is found in approximately 10% of the cytogene- 
5 tically abnormal leiomyomas. Furthermore, chromosome 6p21 — 
pter, has been found to be recurrently involved in roughly 
5% of the cases studied. Finally, a small percentage (ap- 
prox. 3.5%) of leiomyomata shows t(l;2) (p36;p24) . In uterine 
leiomyomata with chromosome 12ql4-15 aberrations, the HMGIC 

10 gene is affected and in those with chromosome 6p aberrati- 
ons, the HMGIY gene is affected. 

The observation that 70% of the uterine leiomyomata 
tested show a normal karyotype prompted studies to investi- 
gate whether activation of a PLAG gene could be found in 

15 uterine tumors. Therefore, Northern blot and PGR analysis 
were performed. 

Specimens of tumor material surgically removed from 
patients and classified by routine pathology as uterine 
leiomyoma, uterine leiomyosarcoma, fibro leiomyoma, or uteri- 

2 0 ne fibroma were used. From the material thus obtained, total 

RNA was extracted using the standard TRI20L™ LS protocol 
from GIBCO BRL as described in the manual of the manufactu- 
rer. This total RNA was used in Northern blot analysis, 
which revealed activation of PLAG1 in about 50% of the 
25 tumors tested, indicating that specific PLAG activation is 
also observed in these types of tumors. These results were . 
confirmed by PCR analysis. These results reveal the possibi- 
lities of using PLAG1 as a diagnostic marker on routine 
biopsied taken from patients. Since PLAG1 is implicated in 

3 0 this type of tumor, PLAG1 can be used as a target for the 

development of novel therapeutic strategies for uterine 
leiomyoma . 

EXAMPLE 7 

35 Diagnosis of lipoblastoma. 

Lipoblastomas are benign pediatric tumors that are 
assumed to result from proliferation of primitive adipocy- 
tes. These type of tumors often grow rapidly but there are 
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no reports that they metastasize. Because of the rapid 
growth and their histopathological features, lipoblastomas 
can mimic myxoid or well-differentiated liposarcoma which 
makes the diagnosis problematic. As a direct consequence, 
5 therefore, some of these tumors have been treated in an 
unnecessarily agressive manner. 

A recurrent rearrangement involving the long arm of 
chromosome 8 has been reported for lipoblastomas. Southern 
blot analysis detected a rearrangement in the PIAG1 gene 
10 opening the possibility of using PLAG1 as a diagnostic 
marker. 

A biopsy of a patient having a lipoblastoma was taken. 
From the material thus obtained total RNA was extracted 
using the standard TRIZOL™ LS protocol from GIBCO BRL as 
15 described in the manual of the manufacturer. This total RNA 
was used to prepare the first strand of cDNA using reverse 
transcriptase (GIBCO/BRL) and an oligo dT(17) primer contai- 
ning an attached short additional nucleotide stretch. The 
sequence of the primer used is as described in Example 2, 

2 0 under point 2.6. RNase H was subsequently used to remove the 

RNA from the synthesized DNA/RNA hybrid molecule. PCR was, 
performed using a gene-specific primer (Example 2, point 
2.6) and a primer complementary to the attached short addi- 
tional nucleotide stretch. The thus obtained PCR product was 
25 analysed by gel electrophoresis. Fusion constructs were 
detected by comparing them with the background bands of. 
normal cells of the same individual. 

In an additional experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 

3 0 primer complementary to the short nucleotide stretch. The 

sensitivity of the test was thus significantly improved. 

EXAMPLE 8 

PLAG2 as diagnostic marker. 
3 5 To evaluate the diagnostic potential of the PLAG2 gene, 

we have first identified on which human chromosome the PLAG2 
gene is located and subsequently established its subchromo- 
somal location. Such mapping studies could link the gene to 
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known tumor- specific chromosome anomalies* Using several 
different PLAG2 containing YACs and cosmids in FISH analy- 
sis, the PLAG2 gene was tentatively mapped to chromosome 
band 6q24, a chromosome segment implicated in various tumors 
5 such as for instance malignant salivary gland tumors. 

Malignant salivary gland tumors are a heterogeneous 
group of tumors comprising at least 17 different entities 
according to the most recent WHO classification (1991) . 
Because of the large number of tumor types and the wide 

10 morphological spectrum that these tumors show they constitu- 
te a well recognized diagnostic problem. The most frequent 
chromosome abnormalities so far observed among malignant 
salivary gland tumors are deletions or translocations invol- 
ving 6q. Most of the deletions so far encountered are 

15 terminal deletions with breakpoints located at 6q22-q25. 

However, there are also a few cases on record with intersti- 
al 6q deletions. The minimal common region lost in the 
majority of all cases is 6q24-qter. The 6q deletions are not 
restricted to certain types of malignant salivary gland 

20 tumors, but seem to occur in all major histologic subtypes, 
including adenoid cystic carcinoma, adenocarcinoma, undiffe- 
rentiated carcinoma, mucoepidermoid carcinoma and acinic 
cell carcinoma. The frequency of 6q rearrangements vary 
somewhat in different tumor types. In for example adenoid 

2 5 cystic carcinomas clonal 6q deletions or translocations have 

been found in nearly 50 % of the cases. The fact that the 6q 
deletions are often the sole karyotypic change, indicate 
that they are primary cytogenetic events of pathogenetic 
importance to malignant salivary gland tumors. In addition 
30 to 6q deletions a subgroup of adenoid cystic carcinomas are 
characterized by a t (6 ; 9 ) (q2 1-24 ;pl3-23 ) . This is a primary 
abnormality that is diagnostic for this tumor types. It 
should also be noted that in addition to malignant salivary 
gland tumors there are several other tumor types with recur- 

3 5 rent deletions of 6q, including e.g. various types of leuke- 

mias and lymphomas, malignant melanomas and neuroblastomas. 
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Involvement of PIAG2 in malignant salivary gland tumors 

This example is performed to test whether the different 
6q deletions commonly seen in different histologic subtypes 
of malignant salivary gland tumors involve the PLAG2 gene at 
5 6q24. s 

8 . 1 Material and methods 

FISH analysis using the PLAG2 containing CEPH mega-YACs 
798D8 and 921C2 was performed on metaphase chromosomes from 
10 two cases of malignant salivary gland tumors, one 

adenocarcinoma and one adenoid cystic carcinoma, with 6q 
deletions. The methods for cytogenetic analysis and FISH 
were the same as described for PLAG1 . 

15 8 , 2 Results 

Hybridization with both YACs on metaphases from the 
adenocarcinoma resulted in signals of the same intensity on 
both chromosome 6 homologs at the expected position. There 
was no evidence of deletion of PLAG2 signals in this case. 

2 0 In contrast, the adenoid cystic carcinoma contained a 

subpopulation of cells, which in addition to the signal from 
the normal chromosome 6 showed either no signal or a weak 
signal from the 6q-marker chromosome. Both YACs gave a 
similar hybridization pattern. 

25 

8 . 3 Conclusion 

Preliminary studies suggest that at least one of the 
breakpoints in the 6q deletions in malignant salivary gland 
tumors map in the vicinity of the PLAG2 locus. PLAG2 is 
30 therefore a candidate gene for the commonly ocurring 6q 
deletions in malignant salivary gland tumors. 

EXAMPLE 10 

Animal tumor models involving PLAG1 as tools in in vivo 

3 5 therapeutic drug testing 

On the basis of the acquired PLAG1 knowledge, animal 
; tumor models can be developed as tools for in vivo therapeu- 
tic drug testing. To achieve this objective (for instance 
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for pleomorphic adenoma of the salivary gland) , two approa- 
ches can be used, gene transfer (generation of transgenic 
animals) on the one hand and gene targeting technology 
(mimicking in vivo of a specific genetic aberration via 
5 homologous recombination in embryonic stem cells (ES cells) ) 
on the other. These technologies allow manipulation of the 
genetic constitution of complex living systems in specific 
and pre-designed ways. For extensive technical details, see 
B. Hogan, R. Beddington, F. Constant ini, and E. Lacy; In: 

10 Manipulating the mouse embryo, a laboratory manual. Cold 
Spring Harbor Press, 1994; ISBN 0-87969-384-3. 

To aim at the mutation of the PIAG1 gene, specifically 
in selected cell types and selected moments in time, the 
recently described Cre/LoxP system can be used (Gu, H. et 

15 al . Deletion of a DNA polymerase B gene segment in T cells 
using cell type-specific gene targeting. Science 265, 103- 
106, 1994). The Cre enzyme is a recombinase from bacteriop- 
hage PI whose physiological role is to separate phage geno- 
mes that become joined to one another during infection. To 

2 0 achieve so, Cre lines up short sequences of phage DNA, 

called loxP sites and removes the DNA between them, leaving 
one loxP site behind. This system has now been shown to be 
effective in mammalian cells in excising at high efficiency 
chromosomal DNA. Tissue-specific inactivation or mutation of 

2 5 a gene using this system can be obtained via tissue-specific 

expression of the Cre enzyme. 

As an example, the development of animal model systems 
for pleomorphic adenoma of the salivary glands using a 
member of the PLAG gene family will be outlined below, such 
30 that the models will be instrumental in in vivo testing of 
therapeutic drugs. 

Two approaches will be followed: 

a) in vivo induction of specific genetic aberrations as 
observed in human patients ((conditional) gene (isogenic) 

3 5 targeting approach) ; and 

b) introduction of DNA constructs representative for the 
genetic aberrations observed in patients (gene transfer 
approach) . 
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DNA constructs to be used in gene transfer will be 
generated on the basis of observations made in patients 
suffering from pleomorphic adenomas of the salivary glands 
as far as structure and expression control are concerned; 
5 e.g. PLAG1 fusion genes with various translocation partner 
genes, especially the preferential translocation partner 
gene CTNNB1 of chromosome 3, i.e. complete PLAG1 under 
control of a strong (salvary gland-specific) promoter. 

10 EXAMPLE 11 

The preparation of antibodies against PLAG-encoded proteins 

One type of suitable molecules for use in diagnosis and 
therapy are antibodies directed against the PLAG genes. Two 
approaches have been followed to develop them. Based upon 

15 the nucleotide sequence of the PLAG cDNAs, computer analysis 
was used to predict the location of antigenic determinants 
within the proteins. Synthetic peptides containing such 
determinants were prepared and rabbits were immunized with 
these. As an alternative approach, cDNA sequences can be, 

20 expressed in appropriate pro- and eukaryotic expression 

systems and the polypeptides synthesized in this way can be 
purified and used for immunization of mice. Cell lines 
obtained upon transfection or electroporation of constructs 
of the PLAG genes were helpful in characterizing the antibo- 

25 dies obtained. 

For the preparation of rabbit polyclonal antibodies 
directed against the PLAG 1 -encoded proteins, use was made of 
the following three commercially available peptides: 
(H-DLSEVRDTQKVPSGKR) 8-Multiple Antigen Peptide 

30 (H-FSSTSYAISIPEKEQPL) 8-MAP 
( H-QLPTQTQDLQDP ) 8 —MAP 

obtainable from Research Genetics Inc., Huntsville, AL, USA. 
The polyclonal antibodies were made according to standard 
techniques. 

35 With respect to PLAG 2 . suitable antibodies were genera- 

ted using hybrid proteins as antigens. These contained 
various portions of the PLAG2 coding region fused in frame 
to GST (glutathion-S-transf erase) . 
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EXAMPLE 12 

Inhibition of PLAG1 expression in tumor cells using antisen- 
se PLAG1 constructs. 

5 2^u Introduction 

PLAG1 can be used as target in novel therapeutic proto- 
cols for tumors in which it is implicated; e.g. pleomorphic 
adenomas of the salivary glands, uterine leiomyoma, etc. 
Cancer as a genetic disease is a logical target for gene 

10 therapy, either by replacing the missing or inactivated gene 
or by suppressing the activity of an unwanted gene. The high 
affinity of short DNA sequences for their target mRNA has 
indicated that antisense oligonucleotides constitute valua- 
ble reagents to specifically suppress the production of gene 

15 products, both in vitro and in vivo . Applications have been 
documented in various biomedical research areas, such as for 
instance cancer research, virology, and cardiovascular 
research. The in vitro activity, toxicity, and pharmacok- 
inetic data of antisense oligonucleotides are encouraging 

20 and in vivo animal experiments demonstrating suppression of 
neointimal formation seem very promising. Also in the field 
of cancer research promising results have been obtained. 
However, potential nonspecific effects of antisense oligonu- 
cleotides should be carefully considered when antisense 

25 agents are used to define biological functions of specific 
genes. 

Since expression of the PLAG1 gene is frequently acti- 
vated or strongly elevated in a wide variety of tumors and 
tumor cell lines, derived from tumors (rhabdomyosarcoma, 

30 uterine leiomyosarcoma, uterine leiomyoma, malignant 

salivary gland tumors) as well as leukemias and lymphomas, 
it was speculated that the PLAG1 -encoded protein might play 
a key role in transformation of cells. This example shows 
that expression of the PLAGl gene can be strongly reduced by 

35 expressing antisense PLAGl sequences and that reduction of 
PLAGl levels in tumor cells results in reversion of the 
transformed phenotype. Thus the expression or administration 
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of antisense molecules can be successfully applied 
therapeutically . 

2 . Materials and methods 
5 2.1 Tumor cell lines 

Tumor cell lines were generated from primary tumors as 
described by Kazmierczak et aL f Genes Chromosom. Cancer 
5:35-39, 1992. 

10 2.2 Culturing tumor cells 

Tumor cells were propagated in TC199 culture medium 
with Earle's salts, supplemented with 2 0% fetal bovine serum 
(GIBCO) , 200 IU/ml penicillin, and 200 microgram/ml strepto- 
mycin. 

15 

2.3 Transfection assay 

Transf ections were performed using various protocols, 
namely: 

1. The calcium phosphate precipitation procedure of Graham 

2 0 and Van der Eb ("A new technique for the assay of the infec- 
tivity of human adenovirus." Virology 52: 456-467, 1973.). 

2. Lipofection: Transf ections were carried out using liposo- 
me-mediated DNA transfer ( lipof ectamine, GIBCO/BRL) accor- 
ding to the guidelines of the manufacturer. 

25 

2.4 Antisense constructs . 
Sense and antisense constructs of the PLAG1 gene were 

obtained by inserting human PLAG1 cDNA sequences in both the 
sense and antisense orientation in expression vectors under 

30 transcriptional control of various promoter contexts, e.g. 
the long terminal repeat of Moloney murine leukemia virus, a 
CMV promoter, or the early promoter of SV4 0. For example, 
the CMV/PLAG1 plasmid was constructed by cloning a human 
PLAG1 cDNA fragment containing all coding sequences of human 

35 PLAG1 in pRC/CMV (Invitrogen) allowing expression under 
control of the human cytomegalovirus early promoter and 
enhancer, and selection for G418 resistance. 
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3 . Results 

3.1 Inhibition of PLAG1 expression 

Inhibition of PLAG1 expression was observed in tumor 
cells after induction of antisense PLAG1 expression in these 
5 tumor cells* Immunoprecipitation and Western blot analysis 
indicated a strong reduction of PLAG1 protein levels in the 
cells expressing antisense PLAG1 sequences. Therefore, this 
approach can be used therapeutically in tumors with involve- 
ment with PLAG1. 
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CLAIMS 



1. Gene being in an isolated form and implicated 
in tumorigenesis having the nucleotide sequence of any one 
of the strands of any one of the members of the PLAG subfa- 
mily of zinc finger protein genes or CTNNB1 genes, including 

5 modified versions thereof. 

2. Tumorigenesis gene (T-gene) as claimed in 
claim 1 having at least homology with the zinc finger 
domains that are typical for the PLAG1 gene the nucleotide 
sequence of which is depicted in figure 4A, or the 

10 complementary strand thereof, including modified or 
elongated versions of both strands. 

3. T-gene as claimed in claim 1 or 2 having essen- 
tially the nucleotide sequence of the PLAG1 gene as depicted 
in figure 4A, or the complementary strand thereof, including 

15 modified or elongated versions of both strands. 

4. T-gene as claimed in claim 1 or 2 having essen- 
tially the nucleotide sequence of the PLAG2 gene as depicted 
in figure 8A, or the complementary strand thereof, including 
modified or elongated versions of both strands. 

20 5. T-gene as claimed in claim 1 having essentially 

the nucleotide sequence of the CTNNB1 gene as depicted in 
figure 9, or the complementary strand thereof, including 
modified or elongated versions of both strands. 

6. T-gene as claimed in claims 1-5 for use as a 

2 5 starting point for designing suitable expression-modulating 
compounds or techniques for the treatment of non-physiolo- 
gical proliferation phenomena in human or animal. 

7. T-gene as claimed in any one of the claims 1-6 
for use as a starting point for designing suitable nucleo- 

30 tide probes for (clinically/medically) diagnosing cells 

having a non-physiological proliferative capacity as compa- 
red to wildtype cells. 

8. T-gene as claimed in claims 1-7, which gene is 
implicated in tumors selected from rhabdomyosarcoma, uterine 

35 leiomyosarcoma, uterine leiomyoma, malignant salivary gland 
• tumors, leukemias and lymphomas. 
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9. Protein encoded by a T-gene as claimed in 
claims 1-8 for use as a starting point for preparing suit- 
able antibodies for (clinically/medically) diagnosing cells 
having a non-physiological proliferative capacity as compa- 

5 red to wildtype cells . 

10. Derivatives of the T-gene as claimed in 
claim 1-8 for use in diagnosis and the preparation of thera- 
peutical compositions, wherein the derivatives are selected 
-from the group consisting of sense and anti-sense cDNA or 

10 fragments thereof, transcripts of the gene or practically 
usable fragments thereof, fragments of the gene or its 
complementary strand, proteins encoded by the gene or frag- 
ments thereof, antibodies directed to the gene, the cDNA, 
the transcript, the protein or the fragments thereof, as 

15 well as antibody fragments. 

11. In situ diagnostic method for diagnosing 
interphase and/or metaphase cells having a non-physiological 
proliferative capacity, comprising at least some of the 
following steps: 

20 a) designing a set of nucleotide probes based on 

the information obtainable from the nucleotide sequence of 
the T-gene as claimed in claim 1-5, wherein at least one of 
the probes is hybridisable to a region of the aberrant gene 
substantially mapping at the same locus as a corresponding 

25 region of the wildtype gene and/or the same or another probe 
is hybridisable to a region of the aberrant gene mapping at 
a different locus than a corresponding region of the 
wildtype gene; 

b) incubating one or more interphase or metaphase 
3 0 chromosomes or interphase or metaphase cells having a non- 
physiological proliferative capacity, with the probe (s) 
under hybridising conditions; and 

c) visualising the hybridisation between the 
probe (s) and the gene. 

35 12. Method of diagnosing cells having a non-physi- 

ological proliferative capacity, comprising at least some of 
the following steps: 
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a) taking a biopsy of a tumor to obtain cells to 
be diagnosed; 

b) isolating a suitable T-gene-related macromo- 
lecule therefrom; 

5 c) analysing the macromolecule thus obtained by 

comparison with a wildtype reference molecule preferably 
from the same individual. 

13. Method as claimed in claim 12, comprising at 
least some of the following steps: 
10 a) taking a biopsy of a tumor to obtain cells to 

be diagnosed; 

b) extracting total RNA thereof; 

c) preparing at least one first strand cDNA of the 
mRNA species in the total RNA extract, which cDNA comprises 

15 a suitable tail; 

d) performing a PCR and/ or RT-PCR using a PLAG 
gene specific primer and a tail-specific and/or partner- 
specif /nested primer in order to amplify PLAG gene specific 
cDNA • s ; 

2 0 e) separating the PCR products on a gel to obtain 

a pattern of bands; 

f) evaluating the presence of aberrant bands by 
comparison to wildtype bands, preferably originating from 
the same individual. 
25 14. Method as claimed in claim 12, comprising at 

least some of the following steps: 

a) taking a biopsy of a tumor to obtain cells to 
be diagnosed; 

b) isolating total protein therefrom; 

30 c) separating the total protein on a gel to obtain 

essentially individual bands and optionally trnasferring the 
bands to a Western blot; 

d) hybridising the bands thus obtained with anti- 
bodies directed against a part of the protein encoded by the 

3 5 remaining part of the T-gene and against a part of the 

protein encoded by the substitution part of the T-gene; 

e) visualising the antigen-antibody reactions and 
establishing the presence of aberrant bands by comparison 



with bands from wildtype proteins, preferably originating 
from the same individual, 

15. Method as claimed in claim 12, comprising at 
least some of the following steps: 
5 a) taking a biopsy of a tumor to obtain cells to 

be diagnosed; 

b) isolating total DNA therefrom; 

c) digesting the DNA with one or more so-called 
"rare cutter" restriction enzymes; 

10 d) separating the digest thus prepared on a gel to 

obtain a separation pattern; 

e) optionally transfering the separation pattern 
to a Southern blot; 

f) hybridising the separation pattern in the gel 
15 or on the blot with one or more informative probes under 

hybridising conditions; 

g) visualising the hybridisations and establishing 
the presence of aberrant bands by comparison to wildtype 
bands, preferably originating from the same individual. 

2 0 16. Method as claimed in any one of the claims 

10-15, wherein the cells having a non-physiological 
proliferative capacity are selected from the group 
consisting of the tumors pleomorphic adenomas of the 
salivary gland, lipoblastomas , uterine leiomyomas, and other 

25 benign tumors as well as various malignant tumors, including 
but not limited to sarcomas (e.g. rhabdomyosarcoma, 
malignant salivary gland tumors) and leukemias and lympho- 
mas • 

17. Anti-sense molecules of a T-gene as claimed in 
30 claims 1-8 for use in the treatment of diseases involving 

cells having a non-physiological proliferative capacity by 
modulating the expression of the gene. 

18. Expression inhibitors of the T-gene as claimed 
in claims 1-8 for use in the treatment of diseases involving 

35 cells having a non-physiological proliferative capacity. 

19. Diagnostic kit for performing the method as 
. claimed in claim 11, comprising a suitable set of labeled 

nucleotide probes . 
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20. Diagnostic kit for performing the method as 
claimed in claim 12, comprising a suitable set of labeled 
probes . 

21. Diagnostic kit for performing the method as 

5 claimed in claim 13, comprising a suitable set of labeled T- 
gene specific and tail specific PCR primers. 

22. Diagnostic kit for performing the method as 
claimed in claim 15, comprising a suitable set of labeled 
probes, and suitable rare cutting restriction enzymes. 

10 23. Method for isolating other T-gene based on the 

existence of a fusion gene, fusion transcript or fusion pro- 
tein in a tumor cell by using at least a part of a T-gene 
for designing molecular tools (probes, primers etc.). 

24. T-gene obtainable by the method of claim 23. 

15 25. T-gene as claimed in claim 24 for use in dia- 

gnostic or therapeutic methods. 
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ABSTRACT 



The present invention relates to a novel gene 
family identified as T-genes and being implicated in tumor- 
genesis having the nucleotide sequence of any one of the 
strands of any one of the members of the PIAG and CTNNB1 
gene families. The genes and their derivatives may be used 
in various diagnostic and therapeutic applications. 
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▼ 

AAAAATTATTAGAAATTCCC C GGGCGGTGTAGAGGCGGCQGACGGGCCGGAG 172 

GCTGCCTTOGCCGTATTKXX^CCCAGAATGCTra 258 
▼ 

CCTCAGATTGOCCAAAATGaGAAGGATTGGATTCCA<^ 344 
▼ 

TTTTTCCATCACTAATCAGTCCCTTTTTQCTTTC 4 30 

GAGTTGGGCAGTTCCCAAGTGTCCAACAAGAAGGCCTGGTTTAGGCTGCG 480 

ATGGCCACTGTGATTCCTGGTGATTTGTCAGAAGTAAGAG^ 566 

* .A T V I P CDLSEVRDTQKVPSGKRKRGETK 29 

ACCAAGAAAAAACTTTCCTTGCCAACTGTCTA^^ 652 
PRKNFPCQLCDKAFNSVEK LKVHSYSHTG 

GAGAGAGGCCCTACAAGTGCATACAACAAGACTGCACCAAG^ 

ERPYKCIQQDCTKAFVSKYKLQRHMATH 
TCTCCTGAGAAAACCCACAAGTGTAATTATTGTGAGAAAA!^^ 

SPEKTHKCNYCEKMFHRKDHLKNHLHTaD 
CCCTAACAAAGAGACGTTTAAGTGCGAAGAATGTGGCAAGAACT^ 

PNKET FKCEECGKNYNTKLGFKRHLALHA 
CCG<LAACAAGTGGTGACCTCACCTGTAAGGTATG*^^ 

ATSGDLTCKVCLQTFESTGVLLEILKSB 
GCAGGCAAGTCGTCTGGTOGGGTTAAAGAAAAAAAGCACCAGTO 

AGKSSGGVKEKKHQCEHCDRRFYTRKDVR 
GAGACACATGGTGGTGCACACTGGAAGAAAGGACT^ 

RaMVVHTGRKDFLCQYCAQRFGRKDHLTR 
GACATATGAAGAAGAGTCACAAICAAGAGCTTCT^ 

HMKKSHNQE LLKVKTEPVDFLDPFTCNV 
TCTGTCX^CTATAAJUUiACGAOCTCCTTCCGGTGATGTC 

SVPIKDELLPVMSLPSSELLSKPFTNTLQ 
GTTAAACCTC^rACAACACTCCATTTCAGTCCATGCA^ 

LNLYNTPFQSMQSSGSAHQMITTLPLGMT 
CATGCCCAATAGAJATGGACACTGTTC^TCCCTCTCACCJ^ 

CPIDKDTVHPSHHLSFKYPFSSTSYAIS 
ATTCCTGAAAAAGAACAOCCATTAAAOGOXAAATTGAGA^ 

* P EKEQPLKCEIESYLMELQGGVPSSSQD 
TTCTCAAGCATCGTCATCAICTAAGCTAGGGTTGGA1CCTCAGATTGCGTC 

s Q A S S SSKLGLDPQIGSLDDGAGDLSLSK 
AAAGCTCTATCTCCATCAGTX^CCCCTAAACACA^ 

S S I S ISDPLNTPALDFSQLFNFIPLNGP 
CCCTATAATCCTCrrATCAGTGGGGAGCCTTCGAATCAG^ 

p Y N P L SVGSLCMSYSQEEAHSSVSQLPTQ 
AACACAGGAICTTCAGGATCCTGCAAACACTATAGGG^ 

TQDLQOPANTICLGSLHSLSAAFTSSLST 
CAAGTACCACCCTCCCACGTTTCCATCAAGCTITTTCAGTAG 
STTLPRFBQATQ 

GAJTC t^GGAC ATOGA TTCATTAC AGAAATCTATCTCTAQCTC TOCCCTACATCACCATTTTTATTTTACTC^CTACTTTAAAACACTATAAAAATTTCTCCTTTTC 
^*T**^*^*^*^TTT^* ^* ^*^^ ^CAC T ATAAAAT^CAA ACT ACCTTTTAAACTGAGCTTTOGAACCA 1 1 I V. IV, 1 I\! ACTTAACTTTACCTCOCT Al 1 1 1V.11-L lUAT 
TCA CTOCC A AIT^ICA CATTTTAAGA l. Ullllllli *^ CATATACCAAAO C CATTATTACTACTAAACTTrrACAAATCCCArTTTCAAATTACTTTTAGATCT^ 
AA A k i l it -A 1 111 llTC TAATAACACTCGC TC T ACCTTTTGAC ATC TGGCTC ATT AAAAAAJ^^AjSCAATACAATCT AAATTCT ATAAAAACTTTCTCAAT AACTCA 
*^2Z23^*** TTTTCTT ACTAOCTTCT AAATGGATT AA T AA TCAACTQC TTCAAATCAATT AACACTCCACTTTCCCAAGAT AATAAATCTTTCTTACATACACC AT 
A ATT TCACATCACTATAi -rv ttjA ACACTC TCTCTTC TC TQGCT AAAA TA TTTOCCA TCTTT ATT ATGAOCCTTTAAGGAAAACAAACC CT AAACAC AAAOC ATC ACT 
AJUATA CCAAAAAC ACAi: ;i ff*TTAOCTCACATGQCA I i! i ll IV. l^^ ACTTAATACTTOCCCCTAAATTACTACACACGATATTTT^TCCTCTTICATOCTTCTTAA 
^^^^CT^TCTTTTCAT^XAA T AA TAGTAATACTCTA TQGt ATTOCCCTCTTCACAC TCGATATATAGG T ACATCTCTTT AffTCTTTTCCACCTTTCACATCC AACCC 
CTCCX7TCAACTCCACCCAOCAAT3TATTTTCA™ 

CTTAAAAACTGCmACTTTTTAC^AACTAAACTAAAACC ATA^CJCTTATATCAI 1 1VA. 1V* AATTTGATATC 1 1 1 1 IV. 1 1 1 C CC TTAAGAATC AAAAGC AGAAAACA 

ctcatcaaSac^acaaai^ 

T>C*raTAACAAACTAAACTAAAACAAACCTATT:^^ 

GACAACCAA CATAA1V. 1 11 T T AAAOC TCTAATTAA TATA TTTTTCTAACGCTCACTTCCGACATTAACCTAACAAACATATCTAT^ 
TTTAGCA^CTTTOCOTTOOC^T^ 

GCCCTATATGCA AAC T TU 1 . TTTTTCT ACGCCC ACTTA TC ACTGCAGA TTGCAATGTTTACC AAGAATTTCTAAAAATGAG TOC ACATTAC TGAAT ATAAT ACA 
rTATJTAAAATAT TTCC GACTACTATAATTTCTTC^W 

^~ T !5 C ^ ACTTCAt * C * ATCAAAC F ^ K n - AAQCAOC 1 * 1 ^ UL * ATTTTATCATGCTCCT ATATTTTTTTAATCCTTCCAGC AC TACTTTTT 
ATACT^ATCTAT^JAAATTTTATTATCAAAAArrACAT^TTAAA 
TTCAAATTCXrrCCT>OCTTTTCATTACC^CAATAT^^ 



^^*AX^I*AAAAAGGGAC TACCACCA^JJTjATTAACT AAAGCA TTTTGGAAA TGGOCAAT AGCGC CAT ATA TGT ATC T ATC TG TATGTG TC TCTCTCCTCTCTCT ATA 
WACACAC ACACATACATACTTA AAW: I IVA.CL H^A TCAAATTC AAA T AC A TCGACGCACATCTTC AGCCC ACC AGTGTT AAAATTTTGGAGTCTTAATTTTC AT 
GTCTACAC C IV. 1 1 11CCTV, 1 IHIACCCCC ACACTTGAAAT AAC ACTTCACACTAACACGCAATTCACCTAA 1 I IV. 1 1 1 1 1 AAAATTGAC TGT AGTGGTCACTAAAC 
CL 1 1 1 1 1 ^* C * C * A 3T TCTATTAAACATCACC C ACACTCOCTTATTTO 1 1 1 1 1 U IIL CTAGAAAA 

A ^^i^ TrCTATCTCJ ^ T ^ iul 1 1 ^ ATTCTCTAgT<^CACA>TCArrATCCITGAA^ 

^* *"^T^GGTTAAAAAAAAAAAAAAAAAAAAAAAAAAAGOGA TT AAC A TT AAATAAAAGT ACTTT ACAATTTGGGCC TC AGAC AAGAT ATTG AAC CTCATTC ACT 
TTC ACTTCCACATCT ATC TAC AAC TT ACG TCACCAAAC ACCGAACTTCAC TC TGGAACC ATC TTCCC ACTCT AACC AATGC T ATC CATTCATC TAT ACAAC T AC C TT 
T ATAOTATCCA TCACT CrTAAAACTTTC ATTTTAAAATCCTAT^ 

C 4 * * 1 1 **rTTC&TTI Ai, IU 11 II U 1V-AACCTCTCTATTTTTCCCTA 1 1 IV. 1 IPC IVA. 1 1 1A TTTTTCTTACTCA1 1 I V. IVA* AATATACTCATATA 1 IV. IV, 1 1 
AATTTC^ACACTACCOCTTTTTAAAAACCATATACTCACTCAAACATGACCCAC^ 
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Fig. 5 




Figure 6A 




Figure 7 



Fig. 8A 

PLAG2 cDNA and Open Read ing Frana (underlined 1 

AGGCTCAAGATAAAGACCTTAAAGATAACTTTGTGTGTCTCTCCCTTTCTAGGTATTTGCATAGGAATCAGAGGA 
GTTAATCTTGTCTCTTCTCACAGGTTTGAATCTTCAGACAAACTT 

G ATGTTCC CTGT C ACT CAGT AGG C AT ATGGCT ACCCATTCTC CCC AG A AATCTC ACC AGTGTG CTC ACTGTG AG A 
AGACGTTC^CCGGAJ^GACCACCTGAAAAACCACCTCCAGACCCA ra ^ 

AGGAQTCTGGGAAGAAGTACAACACCATGCTGGG CTATAAGAGGCACCTGGCCCTCCATGCGGCCAGCAGTGG^ 
ACQTCAgCT(?Tgg<^TCTGTGCCCTGGAGCTAGGGAGCACCGAGGTGCTACTGGACCAC OT 

AAG AG AJ\G C CCQ CT AO CGG AACC AAGG AAAAG AAGC ACC AGTG CG AC C A CTGTG AAAG ATG CTTCT AC AC CCGG A 

AQQATQTQCgAC<?CCACCTGCTGGTCCACACAGGATGCAAGGACTTCC T GTGCCAGTTCTGTGCCCAGAGATTTG 

GGCGCAAGGTTCACCTCACCCGGCATACCAAGAAGACCCACTCACA G GAGCTGATGAAAGAGAGCTTGCAGACCG 

GAGACCTTCTGAGCACCTTCCACACCATCTCGCCT TCATTCCAACTGAAGGCTGCTGC 

TA<?<M<?PTTCTCCCCAGAACGGGCT^ 

AACAAGCCGqCCAGCCrATGCAGCCGCrGCCAGAGTCCCTGGCCTC C CTCCACCCCTCGGTATCCCCTGGCTCTC 

CTCCGCCACqCOTCCCAATCACAAgTAC^ 

AAGCAgATACTAAAggTTTTTQCAATATC^ 

AGCTCAACCCAGGTTTTGATCTGGCTAAGGGAAATGCTGGT 

CTGTGAACCTAACAATACCTGCCTCTCTGGACCTG TCCCCCCTGTTGGGCTTCT 
CCCAAA^TACC^ITTOOGAATAGCACTCrTGCCCTGGGGCCTGGGGA^^ 

GGCAGCAGCAGCAAGAACCCCCACTTGCCATGGGCACTGTGAGC C TGGGCCAGCTCCCCCTGCCCCCCATCCCTC 

ATQTgTTCTCAQCT^CACTWCTCTGCcATCCT GCCTCATTTCCA 

TGTATTTTTCCGTATTCTGGAAGATGTTTTAAGAAGCAITTTAAATGTCAGTO 

AACGAGACTGGGACTATGGCTTATTCAGTGATGACTGGCTTGAGATGATAAGAGAATTCTCGAACTGC^TGTATT 

GTGCCAATCTGTCCTGAGTGTTCATGCITTGTACCAAATTTAATGAACGCGTGTTCT 

ATTGTCATAACCAACATCCAAAATGACGGCTGCTATATATAAGTGTTTGTCCATATGGGAATTTAAT 

C CATGG AT CC AT AATGTT AAACT AAAT AACTTT ATGGTGG CA CTG CCTAGT AAGGG AACT ATGG AAAGGTTTGG A 

TTTCTCCAAATCTGG^ 

T CI u l-i L rCAGGG ATTTTTCTA C CTTC AGGGTTGG ATGT AGTTT AGTT ACT ATTAC CAT AGCCAAC CTGT AGTTTT A 

CATATACATTTTCTTGTGGAGCAATAGAGTTCTCCATTTTACAGAAGC^ 

ACAAGATGCTCCAATGTGAGTTATGACrTCATTTATCT 

AGA AAAAAAAAAAAAATCACTGTGTAACCAGGTTAAG^ 

CTGACTTTAATATTGATTATATTTTTAACAGGGAATTTA 

TATTAAAACAAGAATTTTCCTTTCCCTCTGTCT 

TTGTTTGTAATCACCAATAAATAAGTGCATTTGTAA 

GTTTTACAATCTAACTATAATCTCCTGAATTTGGTATC 

TTTAAAGCTCCATTGTCTTATGTTTTTAGAGGCTTTTCCGTAAACATATATCTTACATATA 
TCTTGCAAATT 
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Fig. 9: Nucleotide sequence of cDNA of CTNNBl (B-catenin) 



1 aagcctctcg 

61 gcggagacgg 

121 cgaggacggt 

181 accatacaac 

241 gttggacatg 

3 01 ttacctggac 

3 61 aggcaatcct 

421 attttctcag 

481 tcgagctcag 

541 cccatctaca 



601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 

1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 

1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
264 1 
2701 
2761 



acagatgctg 
cacacgtgca 
taaggctgca 
gcgttctcct 
aacagctcgt 
ggccatcttt 
ttctgtgttg 
taaaatggca 
aaatgtitaaa 
agaaagcaag 

gacctatact 
ctgctctagt 
cctgacagat 
agatgctgca 
gggttcagat 
caataattat 
tactgtcctt 
tcat-ctgacc 
tggactacca 
tactgttgga 

gcagggtgcc 
ccgtacgtcc 
agttgaaggt 
tatcagagga 
catccaaaga 
agctattgaa 
aggtgtggcg 
ttacaagaaa 
ttggaatgag 
tcgccaggat 
gggtatggac 
agttgatggg 
tgacagcaat 
tctgaacttg 
tatctcagaa 
aaactttttg 
gactcaagaa 



gtctgtggca 
aggaaggtct 
cggactcccg 
tgttttgaaa 
gccatggaac 
tctggaatcc 
gaggaagagg 
tccttcactc 
agggtacgag 
cagtttgatg 

aaacatgcag 
atccctgaac 
gttatggtcc 
cagatggtgt 
tgtaccgctg 
aagtctggag 
ttttatgcca 
gtgcgtttag 
ttcttggcta 
ctcatcatac 

tacgaaaaac 
aataagccgg 
ccaagtcaac 
actaaacagg 
gatataaatg 
aagaacaaga 
cgggctggtg 
agccgacacc 
gttgtggtta 
ttgattcgaa 

attccacgac 
atgggtggga 
tgtaccggag 
ctaaatacca 
gtagctgcag 
gctgagggag 
acatatgcag 
cggctttcag 
actgctgatc 
gatcctagct 
cccatgatgg 
ctgccagatc 
cagctggcct 
cattgtgatt 
agtgcctgac 
ttctggtcct 
gtgaagaatg 



gcagcgt.tgg 
gaggagcagc 
cggcgggagg 
atccagcgtg 
cagacagaaa 
attctggtgc 
atgtggatac 
aagaacaagt 
ctgctatgtt 
ctgctcatcc 

ttgtaaactt 
tgacaaaact 
atcagctttc 
ctgctattgt 
ggaccttgca 
gcattcctgc 
ttacaactct 
ctggtgggct 
ttacgacaga 
tggctagtgg 

tactg^tggac 
ctattgtaga 
gtcttgttca 
aagggatgga 
tggtcacctg 
tgatggtctg 
acagggaaga 
aagaagcaga 
agctcttaca 
atcttgccct 

tagttcagtt 
cacagcagca 
cccttcacat 
ttccattgtt 
gggtcctctg 
ccacagctcc 
ctgctgfcttt 
ttgagctgac 
ttggacttga 
atcgttcttt 
aacatgagat 
tggggcatgc 
ggtttgatac 
ggcctgtaga 
acactaacca 
ttttggtcga 
cacaagaatg 



cccggccccg 
ttcagtcccc 
agcctgttcc 
gacaatggct 
agcggctgtt 
cactaccaca 
ctcccaagt.c 
agctgatatt 
ccctgagaca 
cactaatgtc 

gattaactat 
gctaaatgac 
taaaaaggaa 
acgtaccatg 
taacctttcc 
cctggtgaaa 
ccacaacctt 
gcagaaaatg 
ctgccttcaa 
tggaccccaa 

cacaagcaga 
agctggtgga 
gaactgtctt 
aggtctcctt 
tgcagctgga 
ccaagtgggt 
catcactgag 
gatggcccag 
cccaccatcc 
ttgtcccgca 

gcttgttcgt 
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ggagcggaga 
gccgagccgc 
cctgagggta 
actcaagctg 
agtcactggc 
gctccttctc 
ctgtatgagt 
gatggacagt 
ttagatgagg 
cagcgtttgg 

caagatgatg 
gaggaccagg 
gcttccagac 
cagaatacaa 
catcatcgtg 
atgcttggtt 
ttattacatc 
gttgccttgc 
attttagctt 
gctttagtaa 

gtgctgaagg 
atgcaagctt 
tggactctca 
gggactcttg 
attctttcta 
ggtatagagg 
cctgccatct 
aatgcagttc 
cactggcctc 
aatcatgcac 

gcacatcagg 
ggggtccgca 
gatgttcaca 
ctttattctc 
caggacaagg 
ttacttcact 
tctgaggaca 
ttcagaacag 
cagggagaac 
ggatatggcc 
caccctggtg 
atggatgggc 
atcatccttt 
gggctcgagg 
cctatgggaa 
tacaaatgga 
tggaatttag 



gcgaggggag 
caccgcaggt 
tttgaagrtat 
atttgatgga 
agcaacagtc 
tgagtggtaa 
gggaacaggg 
atgcaatgac 
gcatgcagat 
ctgaaccatc 

cagaacttgc 
tggtggttaa 
acgctatcat 
atgatgtaga 
agggcttact 
caccagtgga 
aagaaggagc 
tcaacaaaac 
atggcaacca 
atataatgag 

tgctatctgt 
taggacttca 
ggaatctttc 
ttcagcttct 
acctcacttg 
ctcttgtgcg 
gtgctcttcg 
gccttcacta 
tgataaaggc 
ctttgcgtga 

atacccagcg 
tggaagaaat 
accgaattgt 
ccattgaaaa 
aagctgcaga 
ctaggaatga 
agccacaaga 
agccaatggc 
cccttggata 
aggatgcctt 
ctgactatcc 
tgcctccagg 
agctgtattg 
ggtgggptgg 
caattgaagt 
ttttgggagt 
caaaccctag 
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STSs used to generate the 300 kb cosmid contig mapping 
at chromosome 8q1 2 and encompassing PIAG1 



STS CH129 

GAATTCTAAAACCATTTATAAATCATACTGAATCCCAGAACAATATATTTTAAACAACTTAA 

AAAAAAGAACAAAATAAAATAGCAAAACATTTTTAAGAGTGTAGATTCTTTGAAATTAAAGG 
ACATACTTACCCTGTAGT 



STS CH280 

GAATTCTTGCACCGGTTTTTTCTTATCAGTGTGGGCTGATGTTCCATTAACTGTGGTGTAAT 

TTGAGTATAGTCACTGACTGATTCTAGATATTTTCAGAGGGTCAAGACTTTTTCTAAGACCT 

TTATATGTGGTTGAATTCTTGTTCTTGGTTTCACAGAAGGTATATTAGCAAAGCATTTTTGG 
TGTTGAAGCTTGGTCTGTGATCTAGT 



STS CH33 

GAATTCGTTTTTATTTGACAAGCACATGAAGCCTTATCAGACGGAGGCCTCAATCCTTTGGC 
TGGGGTTTATAAGCAGGTAGCGCTAGACCTTCCCATTCTACATAAGCTGATGGGCACGGTAA 
TAGCTGGGGGTTTTCTCACAAGTCAAAGACAAATTGTCTGTTTTCAAGCGTGTGAAACAGTT 
wAAWACGTTTGAGGTCTCTCTCTTGCTTCATAGGCCATCTTGGCTCAGACATTCTACAGMCA 



STS EMI 56 

TCTGAGCAACAAGAGCGAAACTCCATCTCAAAATATATATATATATAGGTAATTGTTGTCAT 
TAATATTAATGTAGTAGCAGCAGCAACAGTCATGGTAGCAATATTGCTCTATTTGGGAGGCA 
ACTT AT AATT ATT AACTGT G G AAT ATCTTTG AAAAATGTTTTTNGC AGAMGTT ATGTTCCC A 
TTCCTGACTGGMGCTCATTATAAATACCCATCTTCTCTGAATAGCGCAAGGACTTTTGAAAA 
AGTGTTCTGAGTAAAC 



STS EMI 95 

ACAATCAATTTTAGAAAGTAATCATTTCATTACCCCAAACTGAAACCCTGTACCTGTTAGCA 
CTCACTCCCCTTTTCATTTTACTTTTTATTTATTTATTTTT^ 

GCCCNGGCNVCAGTGCAGTGGCACAaTCTCAACTCACTGCAACCTCTGCCTGCCAGGGTCAA 

GTGATTCTTGTGCCTCAGAGTCCCAAGTACCTGGGATTACAGGCATAAGCCACCACGCCTGG 

CTAAATTTTGTATTTTCAGTAGTGACGGGGTTTCACCATGTTGGCCAGGCTGTCTCAAACTG ' 

CTGACCTCAGGTAATCCACCCTCCTCAGCCTCCCAGAGTTCTGGGATTACAGGCGTGACACC 

GTGCCTGGCTCATTTTATTTTTTAGAGATGAGATCTCACTCTKWTTGCCCAGGCTTCAGTGC 

ATTGGCGTCATGATGGCTCACTGCAGGCTTCAGCTCCTGGCCTCAAaGCATCCTTCCGCCTCA 



STS EM208 

CTAGGCGACAGAGCAAGACTCTGTCTCAARGAAAAAaAAAAAaVRAAAAAAATTACCAAAAC 

TGACTACAGAAAAVHGVARGGT^GAATAGCCTTACATTTGGVAAATAATTTTTATTTATAAT 

TAAAGATATTTTTATAAAAVVTACTCTAGGCCCATAAGGCTTCACAGGTTAATTGTATTAAA 

TATTTAAGGAAA AAATA ATACCAATCTTATTCATAGTCTTTCAGAAAATAGAGGCGTATCCA 

TTTTTCTAACTCATTTTAAGAAATAGCATCATTCTAATATCAAAAGCAAACAAGGMCAT^ 

AAAGAAGAAGGGGAAGAAGGAAGAGGAAGAGGAGGAGAAAGGGAAGCAGGAGATGGAGAAGA 

AGGAAGCCAGGTACAGTGCAATATTTCTCATGAACATAAACACAATTTTTAAAAAGTATTAM 

CAGGCTGGGCTTGGTGGCTCTCACCCGTAATCCCAGCMCTTTGGAAGGCCCAAGGCGGGTGG 
GHCACAAGGTCAGGGGTTCGAG 



Figure 1 0 



STS EM216 

TACTRACTGCTGTGCAGTTBTCcTGCAGTCAGTTCCAGAGGTCATTTCTAACGTTGCACTAT 
GGGKCTATTTAATAGGTTTCCTAAAGAACAAACATATCTCTTTAbAGTTACTCAGAGGGTAC 
ACAATGATGATGTCACACAATTAATTACCTATTAAGACTGAAATCCAGCAATGCATAGKGTG 
TGGACTTTACGCACATCCAGAAAAAGTTCTAGCACAAATTGTTTTHGTMTYATATATTTCAG 
AAGCCATAGAAACACTATTAAAGCCCTCCCTAATCACTTAGGGATGCAAAaTCAATAT 



STS EM317 

GACCAACAAAGGCACACAAAGATTGGTTGCTTTCTGAAGAATCTAAAAATGGCATTGGGTAT 
AGGAGTTGGGGAAGCAAGTTGTATAGGCACCTACACTTAAGATAATTTGTCAATTATACAAA 
TAATTTTTAAAGTTTAAGCCCCTTTCTGACATGACACGTCCATGGGTCCTTCACCCTTYTtK 
KTCTCCTSCAGAGCTCCAGTCTGCCYYTTYTTKSCTCTGAGCTCCAAAAMCAGTGAWTCCCC 
TGAAGTTACCTAGMCCCMCCATACAGTTTGTGACTCCCTAWMCcGGGGGTACCyTCCCATGY 
CTGGCTAATAyTGABTYTTGTDTACCGTGGCTTCTGTGTTACTACATTTGTTTTARTGGAAT 
TWATwAArGGGAAGCCTATCAA 



STS EM416 

GAGCAACTGAaCDNAGATTGGGTGAGGTAAGATGTGGGCTGCACAGGTGAGGCTGGAGAGGT 
GGGGAGTGCGTCCCAGTCGGGGGAGAAGAAGAAAAGGGCAGACTAGGGTAGAAATGCTTATW 
ACTcCTGTGACTGGAGCTGATGGTGTCTTAAGGAAAGTGGTGGGAAGGGAGGVCTGCAGAAA 
GGCAAGGCTGGAGTCGACTGAAGGCTGGAGAGCCACTGCTTTAACAAGTGTAMCTGGAGATG 
GAAGGGGCTGCAGGACAGGTCACTCAGCCAGTKGTGTGGARGCAATCTCACC 



STS EM443 

TTGATATTTGTTCTAACTCCACATTAACTATTGACAAATACTGTAAATTGTAGCTACCATCT 
GTTACGTAGCTAGCAGGTACCCTAACAGCAATGGGTCAGCTTTTGAGTAGCGTTTCAACCAT 
GTTACCTCGAGTACGGTGTGGTGAGGCCAGACGCAGATGGAGAGAAAGAAACAGAATCGAGC 
ATTTCCATTTTGTTTTGCTCACAGTCCCCAGGGGCAAACACAGCACAGCCTACAGGACCATG 
AAGGGGAGCACTGGGGTCACTCATGAAGCAGGGAGGTCGGGCCAGTGGTGGGGGgCCTTTAT 
GTGTTTTCCTCAGGAAGGAATGGGCAAGGCAGGGTAAGCATGTTCAGGACTGGTTAATTTGA 
ATAACTTCAGGGGGgCTCTAGGGCCTGgRGGCTGCCCCTGGTTGTCTGGTACCYgGSCCTG 



STS EM4 6 

ATATCAATCTTGGGTCTATGTATGTTTTTGCTTTTCCCcAGTGTTCCAGGCATGATGCTAAG 
GATATAGTGGATGATGAAATATATGCTTGCTGAATATGGGAATAAGAATTATTTTATGATCA 
GAHTTTTTTTTTTTGAGATGGAGTCTCGCTCTGTCAcNMaGGCTVGTGTGCAGTGGCATGAT 
CTCAGCTCACWGCAACCTCTGVCTCCTGGGTTCAAGTGATT 



STS EM47 

GTAGAGACACACTAGGCATGCACAGACCAGTGCAGAATGAACAATATTTGTTACATGTGTAG 

TTCTTTATGGTTTACAAAACTCTCCCAGCCATTATCTTCTTTCAGCCTTATAAAAGACAGAG 

CATATTTTATTATCCTCATTTACCTWHTCTAGTAAGGCATTTTTTCTTTTTTTC r ^ 

GATATAAGGCTTAGGAAAAAAGTGAATACTACGATAAATGAATACTAGGAAAAGACATCACA 

ATCACAAATTTATTAATATCAGAAAACAGDTTTTAAGAATAA 



Figure 3.0 (continued) 



STS END2 

TAATTTATCACTACGGAATTCTGTGCAGTGAGATCAAAGAGCTGTGTATGCCCATAATGTGA 
TTTTACAGCCATTTTGTAAAAACTGTAAAATACCTTAATATTCAATTTGGCTTAAGGTACAT 
TGAGGACTTCTGGTTGAAAATTACAGAGTGGTGAAGATTC 



Known STSs 
PENK 
D8S285 
MOS 



STSs part of PIAG1 

EM265 
KK64 

KK63/EM209 
KK55/CH283 
EM224 
EM387 



Figure 10 (continued) 
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